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CASPR/pl»0, A FOHCTIOIOI. LIGMID FOR RPTP-BBTA AHD THB WtONM. 

GEI,!. RBCOgMITIOK yfQI.ECOI.B COKTACTXM 

The present application claims priority under 35 U.S.C. 
S 119(e) to provisional application serial No. 60/014,199, 
filed March 21, 1996, the entire contents of which is 
incorporated herein by reference in its entirety. 

1. IMTROPDCTIOH 

The present invention relates to the 190 Kd neuronal 
protein (hereinafter "pigO" , -CASPR" or «CASPR/pl90« ) that 
interacts with contactin, and with the carbonic anhydrase 
("CAH") domain of the receptor-type tyrosine phosphatase 
RPTP-/5, specific peptides thereof and nucleic acid molecules 
encoding such pl90 proteins and peptides. The protein is 
also referred to as CASPR, for Contactin Associated PRotein. 
The CAH domain of RPTP/S has previously been identified as a 
ligand for contactin, and the binding of the CAH domain of 
RPTP/S to the contactin on neural cells results in neurite 
growth, differentiation and survival. CASPR/pl90 has been 
identified as a potential bridge that couples contactin, a 
GPI-linJced protein, with intracellular second messenger 
sys1:ems. The invention also relates to compounds that mimic, 
enhance, or suppress the effects of pl90, including those 
molecules which act downstream in the signal transduction 
pathway that results from the binding of the ligand to 
contactin. In addition, the invention also relates to the 
use of such compounds to treat neurologic diseases including 
those characterized by insufficient, aberrant, or excessive 
neurite growth, differentiation or survival. 

30 

2. BAGKGROnWD OF T HE INVENTION 

The ability of cells to respond to signals from their 
microenvironment is a fundamental feature of development. In 
the developing nervous system, neurons migrate and extend 
axons to establish their intricate network of synaptic 
connections (Goodman and Shatz, 1993, Cell/Neuron (suppl.), 

- 1 - 
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72/10:77-98) . During migration and axonal pathfinding, cells 
are guided by both attractive and repulsive signals (Hynes 
and Lander, 1992, Cell, 68:303-322; Keynes and Cook, 1992, 
Lurr. Opin. Neurobiol., 2:55-59)- The ability of the neuron 
5 to respond to these signals requires cell surface molecules 
that are able to receive the signal and to transmit it to the 
cell interior resulting in specific biological responses. 

It is well established that protein tyrosine 
phosphorylation is responsible for the regulation of many 
10 cellular responses to external stimuli crucial for cell 

growth, proliferation and differentiation (Schlessinger and 
Ullrich, 1992, Neuron, 9:383-391)- Tyrosine phosphorylation 
has been implicated in several developmental processes in the 
nervous system. For example, receptor tyrosine kinases were 
15 shown to effect neuronal survival (Chao, 1992, Neuron, 9:583- 
593), and cell fate determination (Zipursky and Rubin, 1994, 
Annu. Rev. Neurosci., 17:373-397). Non-receptor tyrosine 
kinases have been shown to be downstream elements in 
signaling via cell recognition molecules that play a role in 
20 cell guidance and migration (Ignelzi et al., 1994, Neuron, 
12:873-884; Umemori et al., 1994, Nature, 367-572-586). 

The transient nature of signaling by phosphorylation 
requires specific phosphatases for control and regulation 
(Hunter, 1995, Cell, 80:225-236). Indeed, many protein 
25 tyrosine phosphatases have been shown to be expressed in 
specific regions of the developing brain, including the 
olfactory neuroepithelixim (Walton et al., 1993, Neuron, 
11:387-400), the cortex (Sahin et al., 1995, J. Comp. 
Neurol., 351:617-631), and in retinal MUller glia (Shock et 
30 al., 1995, Mol. Brain Res., 28:110-116). Furthermore, 

expression of several tyrosine phosphatases, such as FTPa 
(den Hertog et al., 1993, EMBO J., 12:3789-3798), PC12-PTP1 
(Sharama and Lombrosc, 1995, J. Biol. Chem. , 270:49-53) and 
several forms of LAR (Zhang and Longo, 1995, J. Cell. Biol., 
35 128:415-431) have been found to be regulated during neural 
differentiation of P19 or PC12 cells. 

- 2 - 
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Receptor-type tyrosine phosphatases (RPTPs) have been 
subdivided into several groups based on structural 
characteristics of their extracellular domains (Charbonneau 
and Tonks, 1992, Annu. Rev. Cell Biol., 8:463-493; Barnea et 
5 al., 1993, Mol. Cell. Biol., 13:1497-1506). RPTP^/f and 
RPTP7 are members of a distinct group of phosphatases, 
characterized by the presence of a carbonic anhydrase-like 
domains (CAH) , f ibronectin type III repeats (FNIII) , and a 
long cysteine free region (spacer domain) in their 

10 extracellular domain (Barnea et al. , 1993, Mol. Cell. 
13:1497-1506; Krueger et al., 1992, Proc. Natl. Acad. 
USA, 89:7417-7421; Levy et al., 1993, J. Biol. Chem., 
268:10573-10581). The expression of RPTP^ is restricted to 
the central and peripheral nervous system, while RPTP7 is 

15 expressed both in the developing nervous system, as well as, 
in a variety of other tissues in adult rat (Canoll et al., 

1993, Dev. Brain Res., 75:293-298; Barnea et al., 1993, Mol. 
Cell. Biol., 13:1497-1506). RPTP/3 exists in three forms, one 
secreted form and two membrane bound forms, that differ by 

20 the absence of 860 residues from the spacer domain (Levy et 
al., 1993, J. Biol. Chem., 268:1053-10582; Maurel et al., 

1994, Proc. Natl. Acad. Sci. USA, 91:2512-2516). The 
secreted form has been identified as a chondroitin sulfate 
proteoglycan from rat brain called phosphocan (3F8 

25 proteoglycan) (Barnea et al., 1994, Cell, 76:205; Maurel et 
al., 1994, Proc. Natl. Acad. Sci. USA, 91:2512-2516; Shitara 
et al., 1994, J. Biol. Chem. 269:20189-20193). The 
transmembrane form has also been sho%m to be expressed in a 
form of a chondroitin sulfate proteoglycan (Barnea et al., 

30 1994, J. Biol. Chem., 269:14349-14352). Purified phosphocan 
can interact in vitro with the extracellular matrix protein 
tenascin, and with the adhesion molecules, N-CAM and Ng-CAM 
(Barnea et al., 1994, J. Biol. Chem., 269:14349-14352; Grumet 
et al., 1993, J. Cell. Biol., 120:815-824; Grumet et al. , 

35 1994, J. Biol. Chem., 269:12142-12146; Milev et al., 1994, J. 
Cell. Biol., 127:2512-2516). 



- 3 - 
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3. SUMMARY OF THE I NVENTION 

The present invention relates to the 190 Kd neuronal 
protein (hereinafter "pl90", "CASPR" or "CASPR/pl90") that 
interacts with contacting and with the carbonic anhydrase 
5 (••CAH") domain of the receptor-type tyrosine phosphatase 
RPTP-^, specific peptides thereof and nucleic acid molecules 
encoding such pl90 proteins and peptides. 

The invention further relates to the use of pl90 and 
related compounds to treat neurologic diseases including 
10 those characterized by insufficient, aberrant, or excessive 
neurite growth, differentiation or survival* More 
specifically, the invention relates to the use of compounds 
that mimic, enhance or suppress the effects of pl90 on 
neurite growth, differentiation and survival. 
15 The invention is based, in part, on the discovery that 

the CAH domain of RPTPiS (RPTP/S-CAH) is the ligand for 
contactin and that its binding results in neurite growth, 
differentiation and survival, and on the further discovery 
that pl90 acts as the bridge between contactin and 
20 intracellular second messenger systems. 

In the examples described infirm, it is shown that 
receptor phosphatase RPTP/S specifically interacts with two 
ligands, one on the surface of glial cells, and the other on 
the surface of neuronal cells. Using expression cloning in 
25 COS7 cells and bioaffinity purification, the neuronal ligand 
was identified to be the rat homologue of the cell 
recognition molecule contactin (F11/F3) . Using combinations 
of soluble and membrane bound forms of RPTPiS and contactin it 
is demonstrated that the reciprocal interaction between the 
30 two molecules is mediated by the CAH domain of the 

phosphatase. Moreover, it is found that when used as a 
substrate, the CAH domain of RPTP)5 induced neurite growth, 
differentiation and survival of primary neurons and IMR-32 
neuroblastoma cells. Using antibody perturbation 
35 experiments, the contactin ligand was found to be a neuronal 
receptor for the CAH domain of RPTP0. The data indicate that 

• 4 - 
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molecule 



and 
5 the 
e 



the interaction 



between contactin, a cell recognition 
and KPTB0, a transmembrane protein tyrosine 

plays an important role in neuronal development 
ion. As explained more fully in Section 5,2, 
experiments of the examples were conducted to 
the interaction between contact in and intracellular 
systems- Binding experiments revealed that 
between pl90 and contact in is important for 
of contact in and RPTP/S-CAH in neuronal growth. 



10 development and differentiation. 



3.1 



DEglNITIONB 



As used herein, the following terms and abbreviations 
shall have the meanings indicated below: 



Table 1 



20 



25 



30 



base pair(s) 



carbonic anhydrase 
complementary DNA 
counts per minute 



domain of RPTP/3 



id 
I 



g ly cosy 1-phosphatedy 1 inos i to 1 
kilobase pairs 
kilodation 



polyacrylamide gel e 
polymerase chain reaction 
receptor type tyrosine phosphatase beta 
ribonucleic acid 
sodium dodecyl sulfate 



bp 

CAH 

RPTP/S 

CDNA 

cpm 

DNA 

FNIII 



kb 
kDa 

Mg 

ng 
ran 
nt 



PAGE 



RPTPjS 
RNA 
SDS 
u 



35 
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Ab used herein, the word "modulate" shall have its usual 
meaning/ but shall also encompass the meanings of the words 
enhance / inhibit, and mimic. In addition, as used herein, 
the word "express ion , when used in connection with a gene 
5 such as pl90, shall have its usual meaning, but shall also 
encompass the transcription of the gene, the longevity of the 
functional mRNA transcribed from the gene, the translation of 
that mRNA, and the activity of the gene product. 

10 4. DESCRIPTION OF T HE DRAWINGS 

FIG. 1 shows the homology between human and rat 
CASPR/pl90 proteins. Also shown are the important domains of 
the proteins as more fully described infra at Section 5.3. 

5. DETAIIiED DESCRIPTION OF THE INVBNTJ:ON_ 



A large group of protein tyrosine phosphatases have 
structural characteristics suggesting that they function as 
cell surface receptors. Receptor type tyrosine phosphatase 0 
(RPTP/J) is expressed in the developing nervous system and it 

20 contains a carbonic anhydrase (CAH) domain as well as a 
fibronectin type III (FNIII) repeat in its extracellular 
domain. A variety of experiments were conducted to search 
for ligands of RPTPp. These experiments led to the 
surprising recognition that the CAH domain of RPTP/S is a 

25 functional ligand for contactin, a GPI-membrane anchored 
neuronal cell recognition molecule that functions as a 
receptor on neurons. The CAH domain of RPTP/J (RPTP/3-CAH) 
induces cell adhesion and neurite growth of primary tectal 
neurons, and differentiation of neuroblastoma cells. Further 

30 experiments led to the recognition that the interaction 
between pl90 and contactin is important in mediating the 
effects of contactin and RPTP/3-CAH. The assays of the 
invention identify compounds that mimic, enhance, or inhibit 
the pl90 mediated effects of contactin/RPTP/3-CAH on neural 

35 cells including, but not limited to, agonists and antagonists 
of contact in/RPTPiS-CAH. Therapeutic uses of compounds so 
identified are also provided. The invention is described in 

- 6 - 
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det^ail in the following subsections and examples for purposes 
of clarity and not by way of limitation. 

5.1. BIOLOOY OP THE IMTBRACTION BETWEEN 
f>f>WP1iCT TM AND THE CAH DOM AIN OP RP' 

5 

During development of the nervous systems, neurons are 

guided by secreted and cell bound molecules that provide both 

negative and positive cues. The experiments described in the 

examples of Sections 6.1 and 6.2 show that RPTP/3, a receptor 

type protein tyrosine phosphatase, may provide such a signal 

xo 

by interacting with the axonal recognition molecule 
contactin- RPTP)S is a developmental ly regulated protein that 
exists in three forms, one secreted and two membrane bound. 
The extracellular region of RPTPjS has a multidomain structure 
consisting of a CAH-like domain, a single FNIII repeat, and a 
long cysteine free spacer region. The complex structural 
nature of its extracellular region may result in a 
multifunctional protein that is able to interact with 
different proteins. As dociimented by the data shown herein, 
the CAH and the FNIII domains bind to at least two potential 

20 

ligands present on neurons or glial cells. Functional 
expression cloning in COS? cells and affinity purification 
with a specific affinity matrix followed by microsequencing 
enabled unequivocal identification of the cell recognition 
molecule contactin (F3/F11) as a neuronal ligand of RPTP^. 
The interaction between contactin and RPTP/S is mediated via 
the CAH domain of the phosphatase, while the FNIII domain 
appears to bind to another molecule expressed on the surface 
of glial cells. It was previously shown that the secreted 
proteoglycan form of RPTP/3 interacts with tenascin, N-CAM and 
Ng-CAM (Grumet et al. , 1994, J. Biol. Chem. , 269:12142-12146; 
Barnea et al., 1994, J. Biol. Chem., 269:14349-14352; Grumet 
et al., 1993, J. Cell. Biol., 120:815-724; Milev et al., 
1994, J. Cell. Biol., 127:1703-1715). Since N-CAM and Ng-CAM 
do not bind directly to the CAH or the FNIII domain of RPTP/?, 
they may interact with the large spacer domain of the 
phosphatase. Alternatively, they could interact with RPTP^ 

- 7 - 
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1:hrough a tihird componen't. Con^act.in may fulfill this 
function since it has been shown to interact with Ng^CAH, Nr- 
CAN, and the matrix proteins tenascin and restriction 
(Brtimmendorf et al. ^ 1993, Neuron, 10:711-727; Morales et 
5 al., 1993, Neuron, 11:1113-1122; Zisch et al., 1992, J. Cell* 
Biol*, 119:203-213). The various subdomains of the 
extracellular region of RPTPiS are able to interact with 
several distinct proteins that are expressed on diverse cell 
types in the central nervous system. 

10 In contrast to other cell recognition molecules that are 

widely expressed in the nervous system, members of the 
contactin subgroup appear to be expressed in a restricted 
manner on specific axons during development (Dodd et al., 
1988, Neuron, 1:105-116; Faivre-Sarrailh et al., 1992, J. 

15 Neurosci., 12:257-267). The spatial and temporal expression 
pattern of these proteins indicates they play an important 
role during development of the nervous system. Contactin was 
found to be exclusively expressed on neurons during 
development in fiber-rich areas of the retina, tectum, spinal 

20 cord and cerebellum (Ranscht, 1988, J. Cell. Biol, 107:1561- 
1573). It was found to be localized in the postnatal and 
adult mouse cerebellum in axonal extensions of the granule 
cells in the parallel layer (Faivre-Sarrailh et al., 1992, J. 
Neurosci., 12:257-267). This pattern of expression is 

25 overlapping with the expression pattern of RPTP/J in the rat. 
RPTPiS was shown to be expressed in fiber-rich regions such as 
the parallel fibers of the cerebellum and the spinal cord 
(Canoll et al., 1993, Dev. Brain Res., 75:293-298; Milev et 
al., 1994, J. Cell. Biol., 127:1703-1715). RPTP/S is also 

30 expressed on glial and radial glial cells, and its secreted 
form is produced by astrocytes. Therefore, both forms of 
RPTP/3 may modulate neuronal function via interactions with 
contactin. 

The contactin subgroup of glycoproteins all share 
35 structural similarity in that they are, glycosyl- 

phosphat idyl inositol (GP I) -anchored proteins. They also 
exist in soluble forms generated as a result of meiabrane 
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release or by expression of alternative spliced forms 
(BrUmmendorf and Rath j en, 1993, J. Neurochem- , 61: 1207-1219) • 
Differential expression of the membrane-bound and soluble 
forms of contact in was found in the hypothalamus-hypophyseal 
5 system (Rougon et al., 1994, Braz. J. Med. Biol. Res., 2:409- 
414) . RPTP)S also exists in either membrane bound or secreted 
forms that are developmental ly regulated. Therefore, both 
RPTP/3 and contactin may act as either a ligand or a receptor 
for each other. Hence, the classical notion of ligand 

10 receptor interaction does not fully explain this system since 
both components might switch roles at different stages of 
development. For example, the soluble form of RPTP/3 produced 
by glial cells may act as a ligand for the membrane bound 
form of contactin expressed on the surface of neuronal cells. 

15 Conversely, the soluble form of contactin may act as ligand 
for the membrane bound form of RPTPiS expressed on the surface 
of glial cells. Moreover, interaction between the membrane 
bound forms of contactin expressed on the surface of neurons 
with the membrane form of RPTPiS expressed on the surface of 

20 glial cells may lead to bidirectional signals between these 
two cell types. Such complex interactions between the 
various forms of RPTPjS and contactin may generate 
development ally regulated unidirectional and bidirectional 
signals. 

25 While not being limited to any theory or explanation of 

how the invention wor)cs, the following is hypothesized to 
explain how the CAH domain of RPTB/S binds to contactin. 
Carbonic anhydrases are highly efficient enzymes that 
catalyze the hydration of CO,. Yet, the CAH domain of PTPases 

30 were not thought to be endowed with enzymatic activity due to 
substitution of two of the three key histidine residues that 
are essential for enzymatic activity (Barnea et al., 1993, 
Mol. Cell. Biol., 13:1497-1506). In contradistinction, 
highly packed hydrophobic core as well as the hydrophobic 

35 residues that are exposed on the surface of carbonic 
anhydrase structure and which are conserved in the CAH 
domains of RPTP7 and jS may be involved in protein-protein 
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interaction and thus function as a ligand binding domain 
(Barnea et al., 1993, Mol . Cell- Biol., 13:1497-1506). It is 
of note that Vaccinia virus contains a transmembrane protein 
with a CAH-like domain in its extracellular domain, which was 
5 thought to be involved in binding of the virion to host 

proteins (Maa et al,, 1990, J. Biol. Chem. , 265: 1669-1577). 
Therefore, in theory but not by way of limitation, compounds 
exhibiting effects which mimic, enhance, or inhibit the 
contactin mediated effects of RPTP0-CAH on neuronal cells may 
10 do so via other members of the contactin family of 

glycoproteins, and may do so even if lacking in CAH activity. 

A number of models may be proposed for how contactin, a 
GPI-1 inked protein that is inserted into the outer leaflet of 
the plasma membrane, transmits a signal into the cells to 
15 promote neurite outgrowth. In theory, and not by way of 
limitation, one possibility is that contactin is able to 
interact with a transmembrane signaling component. The pl90 
(also referred to as pl80) protein that was coprecipitated 
with contactin has been a candidate for such a signaling 
20 protein. pl90 may be membrane-associated since it may not be 
released by phospholipase C treatment. Another potential 
signal transducer may be Ll/Ng-CAM or a related molecule. 
This transmembrane CAM was shown to interact with contactin 
(Briimmendorf et al. , 1993, Neuron, 10:711-727), and to 
25 initiate second messenger cascade via its cytoplasmic domain 
(Doherty and Walsh, 1994, Curr. Opin. Neurobiol., 4:49-55). 
The best characterized GPI linked signaling protein is the 
ciliary neurotrophic factor receptor (CNTF receptor) ♦ 
Following ligand binding, the CNTFR interacts with the signal 
30 transducer gpl30. The gpl30 protein that is shared by 
several lymphokines and cytokines such as IL-6, LIP and 
Oncostatin, undergoes dimerization followed by recruitment of 
the cytoplasmic Jak protein tyrosine kinases. Stimulation of 
the Jak kinases leads to activation of both the Ras/MAP 
35 kinase and the Stat signaling pathways that relay signals 

from the cell surface to the nucleus. A contactin associated 
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protein such as pl90 may have a function similar to the 

function of gpl30. 

As demonstrated by the examples infra, the binding of 
the CAH domain of RPTP/3 to contactin leads to cell adhesion 
S and neurite outgrowth. It seems unlikely that the induction 
of neurite growth is a default response resulting from cell 
adhesion per se. Neurons were found to adhere to 
extracellular matrix proteins such as tenascin and 
restriction in short term binding assays, but these 
10 substrates did not promote further neurite extension 

(Schachner et al., 1994, Perspect. Dev. Neurobiol., 1:33-41). 
It was recently reported that the FNIII domain of contactin 
is responsible for adhesion, while the neurite promoting 
activity was attributed to the Ig domains (Durbec et al., 
XS 1994, Eur. J. Neuro., 6:461-472). Another study demonstrated 
that contactin can mediate the repulsion of neurons by 
restriction (Pesheva et al. , 1993, Neuron, 10:69-82). Again, 
this effect was proposed to occur in a stepwise manner, first 
an adhesion step that was followed by a signal that was 
20 transduced to the cells leading to retraction." Therefore, in 
light of the results presented herein, it may be that in 
response to different stimuli, the same molecule can transmit 
opposite signals depending on the context or milieu. 
Whatever the mechanism, the results presented here 
25 demonstrate that a receptor type tyrosine phosphatase serves 
as a functional ligand for a GPI-anchored cell adhesion 
molecule . 

Contactin may also serve as a functional ligand for 
RPTPjS. Modulation of phosphatase activity by neuronal 

30 contactin may result in signaling to glial cells. If this 
does occur, this kind of bidirectional flow of information 
should allow the interacting cells to respond guickly to 
local environmental changes during development. Two other 
receptor type tyrosine phosphatases RPTP/x and RPTPk were 

35 shown to mediate cell-cell interaction in a hemophilic manner 
(Brady-Kalany et al., 1993, J. Cell. Biol., 122:961-972; 
Gebbink et al., 1993, J. Biol. Chem. , 268:16101-16104; Sap et 
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al.^ 1994, Mol. Cell. Biol., 14:1-9). However, changes in 
catalytic activity as a result of these interactions could 
not be detected. These phosphatases are joining a growing 
family of proteins that are involved in cellular recognition 
5 that contain intrinsic enzymatic activities, including 

kinases (Dtrk; Pulido et al., 1992), EMBO J., 11:391-404, /8 
subunit of Na*, K*-ATPase (AMOG; Gloor et al., 1990, J. Cell. 
Biol., 109:755-788), and 0 subunit of prolyl 4-hydroxylase 
(cognin; Rao and Hausman, 1993, Proc. Natl. Acad. Sci. USA, 
10 90:2950-2954} . 

In summary, the experiments and data described herein 
demonstrate that RPTPjS is a functional ligand for the GPI- 
anchored cell recognition molecule contact in. The 
interactions between these two proteins is mediated by the 
15 CAH domain of the phosphatase. In addition, the FNIII of 
RPTP/3 repeat is recspiired for interaction with glia cells, 
demonstrating that the multidomain structure of RPTP/S enables 
interactions with different proteins, and indicates that 
other potential ligands may modulate these interactions. 

20 

5.2 BIOI^OGY OP THE Pl 9 0 INTERACTION 

Applicants have discovered that contactin functionally 
interacts with pl90, a novel mammalian protein described 
herein. In light of this information, pl90 may play an 
25 important role as the link between contactin mediated neurite 
growth, differentiation and survival and the intracellular 
second messenger signalling responsible for this contactin 

mediated effect. 

Cell recognition molecules that contain immunoglobulin 
30 (ig)-liJce domains and fibronectin type III repeats (FNIII) 

mediate the interaction of neurons with their local 

environment during development (Edelman et al., 1991, Annu. 

Rev. Biochem., 60:155-190; Rathjen et al., 1991, Semin. 

Neurosci., 3:297-307; Sonderegger et al., 1991, J. Cell. 
35 Biol., 119:1387-1394). Based on structural similarity they 

are subdivided to three groups. The first is represented by 

NCAM that exist in several alternatively spliced forms 
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(Cunningham et al., 1987, science, 236:799-805). The second 
is the Ll/Ng-CAM subgroup that also contains Nr-CAM and 
Neiirofascin (Griimet, 1992, J. Neurosci. Res., 31:1-13). The 
third group contains contactin and its mouse and chicken 
5 homologues F3 and Fll (Ranscht, 1988, J. Cell. Biol., 

107:1561-1573; Brummendorf et al., 1989, Neuron, 2:1351-1361; 
Gennarini et al. , 1989, J. Cell. Biol., 109:755-788; Reid et 
al., 1994, Brain Res. Mol. Brain Res., 21:1-8; Berglund et 
al., 1994, Genomics, 21:571-582), TAG-1 and its chick and 

10 human homologues Axonin 1 and TAX-l (Furley et al. , 1990, 
Cell 61:157-170; Hasler et al., 1993, Eur. J. Biochem. , 
211:329-339; Zuellig et al., 1992, Eur. J. Biochem., 204:453- 
463) and BIG-1 (Yoshihara et al., 1994, Neuron, 13:415-426). 
The glycoproteins from the contactin subgroup are all 

15 glycosylphosphatydylinositol (GPI) -anchored proteins composed 
of six C2 type Ig-like domains and foxir fibronectin type III 
repeats. They can also be found as secreted proteins as a 
result of membrane release and shedding or by alternative 
splicing that generate soluble forms (Brttomendorf et al., 

20 1993, J. Neurochem., 61:1207-1219). In contrast to other 
cell recognition molecules that are widely expressed in the 
n@T-vous svstiem. members of the contactin sxibgroup are 




in a more restricted manner on 
during development (Dodd et al., 1985, Neuron, 1:105-116; 
25 Faivre-Sarrailh et al., 1992, J. Neurosci., 12:257-267; 

Yoshihara et al., 1994, Neuron, 13:415-426). This spatial and 
temporal expression pattern suggests that they play a key 
role during axonal guidance and synapse formation. 

Contactin interacts with other members of the Ig 
30 superfamily and with extracellular matrix components. Direct 
interaction was demonstrated between contactin and NgCAH and 
NrCAH, the extracellular matrix proteins tenascin and 
restrictin and with the carbonic anhydrase domain of the 
receptor type tyrosine phosphatase /3 (RPTP/3) (Briimmendorf et 
35 al, 1993, Neuron, 10:711-727; Zisch et al., 1992, J. Cell. 
Biol., 119:203-213; Zisch et al. , 1992, J. Mol. Neurosci.; 
Pesheva et al., 1993, Neuron, 10:69-82; Peles et al., 1995, 
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Cell, 82:251-260). These interactions are mediated by 
different Ig-like domains, the first and second domains bind 
to tenascin and Ng-CAM while the second and third mediate its 
interaction with restrictin (Zisch et al,, 1992, J. Cell. 
5 Biol., 119:203-213; Zisch et al., 1992, J. Mol. Neurosci. ; 
Brtimmendorf et al, 1993, Neuron, 10:711-727.)- Moreover, 
contactin has been shown to be involved in both positive and 
negative responses of neurons to various stimuli (Peles et 
al., 1995, Cell, 82:251-260; Pesheva et al. , 1993, Neuron, 

10 10:69-82). When presented as a ligand to neurons, either as 
a membrane-bound or a soluble form, contactin induces axonal 
growth (Clarke et al., 1993, Eur. J. Cell. Biol., 61:108-115; 
Durbec et al., 1992, J. Cell. Biol., 117:877-887; Gennarini 
et al., 1989, J. Cell. Biol., 109:755-788). Its neuronal 

15 receptor has been identified as the recognition molecule Nr- 
CAM (Morales et al., 1993, Neuron, 11:1113-1122). Contactin 
itself functions as a receptor present on neurons. It 
mediates their repulsion by the extracellular matrix protein 
restrictin and neurite outgrowth induced by the CAR domain of 

20 RPTPiS (Pesheva et al., 1993, Neuron, 10:69-82; Peles et al., 
1995, Cell, 82:251-260) • Thus, depending on the cellular 
context and ligand, contactin can mediate two opposite 
cellular responses (e.g. repulsion vs. adhesion and 
outgrowth) . 

25 The function of cell recognition molecules involves two 

stages, first an adhesion step and then a signal transduction 
step. Signaling by these molecules has been shown to utilize 
different second messenger systems including GTP-binding 
proteins, calcium influx and tyrosine kinases (Reviewed in 
30 Doherty et al., 1994, Curr. Opin. Neurobiol., 4:49-55). Non- 
receptor tyrosine kinases of the src family connect different 
external signals with intracellular signaling pathways. They 
are highly expressed in developing neurons and are enriched 
in the nerve growth cones (Bare et al., 1993, Oncogene, 
35 8:1429-1436; Maness et al., 1994, J. Biol. Chem. , 193:5001- 
5005; Sudol et al. , 1988, Oncogene Res., 2:345-355.). There 
is increasing evidence that links these kinases to signaling 
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pathways that are utilized by neural cell recognition 
molecules. Recently, the potential role for Src and Pyn 
kinases as a downstream component in LI and N-CAM signaling 
was demonstrated using cerebellar neurons from src and fyn- 
5 knockout mice (Beggs et al., 1993, J. Cell Biol. 127:825-833; 
Ignelzi et al., 1994, Neuron, 12:873-884). In addition, 
activation of Fyn by the cell adhesion molecule MAG in 
oligodendrocytes was implicated as a regulatory signaling 
event during myelination (Umemori et al., 1994, Nature, 
10 367:572^576) . Finally, Fyn has been shown to associate with 
contactin in mouse cerebellum and in chick neurons in culture 
(Olive et al., 1995, J. Neurochem. , 65; Zisch et al. , 1992, 
J. Cell. Biol., 119:203-213; Zisch et al., 1992, J. Mol. 



15 The method by which contactin, a GPI-linked protein, 

associates with a cytoplasmic kinases is unclear. One 
possibility is that contactin interacts with a transmembrane 
protein that acts as a ••bridge" to the cell interior. 

The experiments described herein by the Examples of 

20 Section 8 describe the cloning of such candidate molecules 
termed CASPR/pl90 (for Contactin Associated PRotein) . These 
190 kDa proteins are found in a complex with contactin and 
the CAH domain of RPTP/S, but only when both pl90 and RPTP^ 
are present on the same surface of the membrane. The 

25 cytoplasmic tail of CASPR/pl90 contains a proline rich 
sequence that interacts with the SH3 domain of Src family 
kinases. Therefore this molecule could be a potential bridge 
that couples contactin, a GPI-linked protein, with 



30 

5.3 MAMMAI,IAN Pl90 GE KES AND GENE PRODUCTS 

The present invention includes, but is not limited to 
CASPR/pl90 peptides, polypeptides, polypeptide fragments and 
fusion proteins as described herein. The present invention 
35 further includes CASPR/pl90 nucleic acid molecules are 
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In one embodiment, such CASPR/pl90 genes and gene 
products are mammalian, preferably hiiman or rodent, genes and 
gene products. In another embodiment, such genes and gene 
products are naturally occurring genes and gene products* 
5 The purification and sequencing of pl90 protein and the 

cloning of mammalian pl90 cDNA may be conducted as described 
for human and rat pl90 cDNA in the Examples of Section 8. 

The human and rat CASPR/pl90 transcripts have open 
reading frames that encode for 1384 and 1381 amino acids, 
10 respectively, and share 93% identity at the amino acid level. 
CASPR/pl90 is a putative type I transmembrane protein with a 
short proline-rich cytoplasmic domain. (The transmembrane 
domain is marked as TMD in Figure 1) . 

A description of the CASPR/pl90 gene product follows. 
15 Such CASPR/pl90 gene products include, but are not limited to 
gene products containing the amino acid secjuence depicted in 
SEQ ID NOS:2 or 4, or the amino acid sequence of at least one 
of the domains depicted in SEQ ID NOS:2 or 4 and/or as 
depicted in Figure 1 and/ or as described below. 
20 The first CASPR/pl90 methionine is followed by a stretch 

of 19-20 amino acid residues rich in hydrophobic residues, 
which probably acts as a signal sequence. The extracellular 
domains of rat and human CASPR/pl90 contain 1281 and 1282 
amino acid residues, respectively. The extracellular region 
25 of CASPR/P190 contains 16 potential N-linJced glycosylation 
sites followed by a second hydrophobic stretch that is a 
typical transmembrane domain. 

The CASPR/P190 extracellular domain is a novel mosaic of 
several motifs that to mediate protein-protein interactions. 
30 Near the N-terminus of mature CASPR/pl90 (109 amino acid 

residues) is a domain with 31-33% amino acid identity to the 
CI and C2 terminal domains of coagulation factors V and VIII, 
26% identity with the neuronal adhesion molecule neurophilin 
(previously known as the neuronal A5 antigen) and 2 0% 
35 identity to a region of discoidin I, a lectin from the slime 
mold Dictyostelium discoideum. The domain is marked as DISC 
in Figure 1. 
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The extracellular domain of CASPR/pl90 also contains 
four repeats, of approximately 140 amino acid residues each, 
with homology to neurexins, a family of polymorphic neuronal 
cell surface proteins. These domains are marked as NX1-NX4 

5 in Figure 1. 

CASPR/pl90 also contains two epidermal growth factor 
(EGF)-like modules (marked as EGF1-EGF2 in Figxire I), 

A single domain related to the C-terminal region of 
fibrinogen beta/gamma (marked as FIB in Figure 1) is flanked 

10 by an EOF and nexirexin motif. 

CASPR/pl90 contains a stretch of 47 amino acids that is 
identical between hiiman and rat CASPR/pl90, and contains 
seven copies of Pro-Gly-Tyr-Xj.j and three additional imperfect 
repeats of this sequence (marked as PGY in Figure 1) . 

15 The cytoplasmic domain of human and rat CASPR/pl90 

contain 78 and 74 amino acids, respectively. These include a 
38-42 amino acid proline-rich motif (38% proline), the 
majority of which consists of proline residues alternating 
with alanine, glycine, or threonine residues (marked as PRO 

20 in Figure 1) . 

In addition to full length CASPR/pl90 gene products, 
CASPR/P190 polypeptide fragments are also included within the 
scope of the invention. In this sense, the term ••CASPR/pl90 
polypeptide fragments'* encompasses polypeptides that comprise 

25 pl90 fragments, deletions, including internal deletions or 
any combination of pl90 fragments or deletions. In 
particular, pl90 polypeptides are those that specifically 
include or lack any of the domains listed in Table 2, below, 
or any combination thereof. 

30 

^ TABLE 2 



DOMAIN NAME 

DISC 

35 

NXl 



CASPR/P190 AMINO ACID RESIDUES 
AS SHOWN IN FIGURE 1 AND IN SEQ 

ID NOS: 1 or 3 



40- 168 (SEQ ID NO:l) 

41- 169 (SEQ ID NO: 3) 

199- 330 (SEQ ID NO:l) 

200- 331 (SEQ ID NO:3> 
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I* 



10 





362-486 


/ SEO 


TD 


NO: 


1 




363-487 


(SEQ 


ID 


NO: 


3 


ciVvr J. 


544- 


-576 




ID 


NO: 


1 




544-577 


(SEQ 


ID 


NO: 


3 


FIB 


582-739 


(SEQ 


ID 


NO: 


: 1 




583-740 


(SEQ 


ID 


NO: 


;3 


NX 3 


809- 


-938 


(SEQ 


ID 


NO: 


;1 




810- 


-939 


(SEQ 


ID 


NO: 


:3 


EGF2 


961-985 


(SEQ 


ID 


NO: 


:1 




962- 


-986 


(SEQ 


ID 


NO: 


:3 



PGY 



NX4 



TMD 



PRO 



1031- 1077 

1032- 1078 

1083- 1218 

1084- 1219 

1282- 1306 

1283- 1307 

1328- 1369 

1329- 1366 



(SEQ ID NO:l 
(SEQ ID NO: 3 

(SEQ ID NO:l 
(SEQ ID NO: 3 

(SEQ ID NO:l 
(SEQ ID NO: 3 

(SEQ ID NO:l 
(SEQ ID NO: 3 



In a further embodiment of the invention, the pl90 DNA 

or a modified secjuence thereof may be ligated to a 

heterologous sequence to encode a CASPR/pl90 fusion protein. 

For example, for screening peptide libraries it may be useful 

to encode a chimeric pl90 protein expressing a heterologous 

epitope that is recognized by a commercially available 

antibody « A fusion protein may also be engineered to contain 

a cleavage site located between the pl90 secpience and the 
25 ^ . . 

heterologous protein sequence, so that the pl90 protein or 

protein fragment can be cleaved away from the heterologous 

moiety. In another embodiment, DNA sequences encoding a 

fusion protein comprising all or a portion of the pl90 

protein fused to another protein with a desired activity are 

30 

within the scope of the invention; e-g., enzymes such as GUS 
(/3-glucuronidase) , /3-galactosidase, lucif erase, etc. 

With respect to nucleic acid molecules, the invention 
contemplates nucleic acid molecules containing: 1) any DNA 
sequence that encodes the same amino acid sequence as encoded 
by the DNA sequences shown in SEQ ID NO:l and SEQ ID NO: 3; 2) 
any DNA sequence that hybridize to the complement of the 
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coding Bequences disclosed herein under highly stringent 
conditions, e.g., washing in O.lxSSC/0.1% SDS at 68^C 
(Ausubel, et al., eds,, 1989, Current Protocols in Molecular 
Biology, Vol* I, Green Publishing Associates, Inc., and John 
5 Wiley & sons. Inc., New York, at p. 2.10.3; see also 

Sambrook, J. et al., (1989) Molecular cloning, Colo. Spring 
Harbor Press, USA, pp. 9 .47-9 .55), and which can encode a 
functionally equivalent gene product; and/or 3) any DNA 
sequence that hybridizes to the complement of the coding 

10 sequences disclosed therein under less stringent conditions, 
such as moderately stringent conditions, e.g., washing in 
0.2xSSC/0.1% SDS at 42*C (Ausubel, et al., 1989, supraj 
sambrook, et al., 1989, supra), yet which encodes a 
functionally equivalent gene product. 

15 As used herein, the term « functionally equivalent gene 

product" refers to a gene product that exhibits at least one 
of the biological functions of the gene product depicted in 
SEQ ID NOS: 2 and/or 4. Such biological functions can 
include, but are not limited to, a function (e.g. , a protein- 

20 protein interaction function) as exhibited by at least one of 
the domains of the SEQ ID NO: 2 or 4 gene products. 

In another embodiment, DNAs that encode mutant forms of 
pl90 are also included within the scope of the invention. 
Such mutant pl90 DNA sequences encompass deletions, additions 

25 and/or substitutions of nucleotide residues, or of regions 
coding for domains within the pl90 protein. These mutated 
pl90 DNAs may encode gene products that are functionally 
equivalent or which display properties very different from 
the native forms of pl90. 

30 The invention also encompasses 1) DNA vectors that 

contain any of the coding sequences disclosed herein (see 
SEQ ID NO:l and SEQ ID NO: 3), and/or their complements (i.e., 
antisense) ; 2) DNA expression vectors that contain any of the 
coding sequences disclosed therein, and/or their complements 

35 (I.e., antisense), operatively associated with a regulatory 
element that directs the expression of the coding and/ or 
antisense sequences; and 3) genetically engineered host cells 
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that contain any of the coding sequences disclosed therein, 
and/or their complements (i.e., antisense) , operatively 
associated with a regulatory element that directs the 
expression of the coding and/ or antisense sequences in the 
5 host cell. Regulatory element includes but is not limited to 
inducible and non- inducible promoters, enhancers, operators 
and other elements known to those skilled in the art that 
drive and regulate expression* The invention includes 
fragments of any of the DNA secfuences disclosed herein. 
10 pl90 secpaence can be obtained from a variety of sources 

including cDNA libraries. For example, appropriate cDKA 
libraries which are good sources of pl90 can be obtained from 
(Clonetech (Palo Alto, CA) , Stratagene (La Jolla, CA) the 
ATCC Repository (Rockville, MD) . In addition, cDNA libraries 
X5 may be prepared from mRNA pools collected from mammalian 

cells which express pl90 either constitutively or inducibly. 
By way of example but not by way of limitation, such cells 
include rat GH3 cells, as well as CHO, VERO, BHK, HeLa, COS, 
MDWCK, -293, WI38, etc. The collection of mRNA pools and 
20 construction of cDNA libraries from these cells are set forth 
more fully in the examples described infra. 

Any of the cDNA libraries described above may be 
screened by hybridization or PGR using the pl90 sequences 
described herein as oligonucleotide probes. Screening can be 
2 5 performed using those portions of the pl90 sequence as 
discussed in the Examples of Section 8, Infra* 

In addition to cDNA libraries, partial pl90 secpience can 
be obtained from any genomic library by library screening or 
from genomic DNA by PGR. Full cDNA sequences can be obtained 
30 by PGR of total RNA isolated from any cell or tissue that 
expresses pl90 including, but not limited to, neuronal 
tissue. Cellular sources also include, but are not limited 
to, hematopoietic, fetal, and embryonal tissues. 

Alternatively, the cDNA libraries described above can be 
35 used to construct expression libraries in a cell line such as 
CHO, VERO, BHK, HeLa, COS, MDWCK, -293, WI38, etc., or other 
cells known in the art to contain little or no autologous 
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pl90 activity. These expression libraries can then be 
screened using antibodies which are specific to pl90. 
Expression libraries for antibody screening may also be made 
in bacteria, such as E. coli , using phage vectors, such as 
5 lambda. These expression libraries may also be screened for 
pl90 enzyme activity as set forth in the examples which are 
described in more detail infra. 

Other isoforms of pl90 may exist and may be cloned using 

the pl90 gene sequence. 

10 

5.4 EXPRESSING THE Pl90 GEK E PRODUCT 

In order to express a biologically active pl90, the 
coding sequence for the enzyme, a function equivalent, or a 
modified sequence, as, e.g. . described in Section 5.3,, 

15 supra, is inserted into an appropriate eukaryotic expression 
vector, i.e., a vector which contains the necessary elements 
for transcription and translation of the inserted coding 
sequence in appropriate eukaryotic host cells which possess 
the cellular machinery and elements for the proper 

20 processing, i.e., signal cleavage, glycosylation, 

phosphorylation, sialylation, and protein sorting. Mammalian 
host cell expression systems are preferred for the expression 
of biologically active enzymes that are properly folded and 
processed. When administered in humans such expression 

25 products may also exhibit tissue targeting. 

The invention also encompasses peptide fragments of the 
pl90 gene product. The pl90 gene product or peptide 
fragments thereof, can be linked to a heterologous peptide or 
protein as a fusion protein. In addition, chimeric pl90 

30 expressing a heterologous epitope that is recognized by a 
commercially available antibody is also included in the 
invention. A durable fusion protein may also be engineered; 
i.e., a fusion protein which has a cleavage site located 
between the pl90 sequence and the heterologous protein 

35 sequence, so that the pl90 gene product, or fragment thereof, 
can be cleaved away from the heterologous moiety. For 
example, a collagenase cleavage recognition consensus 
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sequence may be engineered between the pl90 gene product, or 
fragment thereof, the heterologous peptide or protein. The 
pl90 domain can be released from this fusion protein by 
treatment with collagenase. 

5.4.1 COM8TRUCTZON OF EXPRESSION VECTORS 

AND PREPARATIOK OP TRANSFECTAMTS 



Methods which are well-known to those skilled in the art 
can be used to construct expression vectors containing the 
pl90 coding sequence and appropriate transcriptional/ 
translational control signals. These methods include in 
vitro recombination/genetic recombination. See, for example, 
the techniques described in Sambook et al., 1987, Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, 

N . Y . , Chapter 12 . 

pl90 proteins produced by these methods would be useful 
for in vitro studies on the mechanism of action of pl90 and 
particularly for further studies on the mechanism of action 
of any inhibitors that are selective for pl90 that are 
identified by drug screening with the stably expressing pl90 
cell lines, as infra, or for investigating the mechanism of 
action of existing drugs or of inhibitors that may be 
identified by other means. The purified pl90 proteins would 
also be useful for the production of crystals suitable for X- 
ray crystallography. Such crystals would be extremely 
beneficial for the rational design of drugs based on 
molecular structure. Expression of these chimeric DNA 
constructs in a baculovirus or yeast system and subsequent 
crystallization of the proteins would yield such data. 

A variety of eukaryotic host-expression systems may be 

30 . 
used to express the pl90 coding sequence. Although 

prokaryotic systems offer the distinct advantage of ease of 

manipulation and low cost of scale-up, their major drawback 

in the expression of pl90 is their lack of proper post- 

translational modifications of expressed mammalian proteins. 

Eukaryotic systems, and preferably mammalian expression 

systems, allow for proper modification to occur. Eukaryotic 
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cells which possess the cellular machinery for proper 
processing of the primary transcript glycosylation, 
phosphorylation, and, advantageously secretion of the gene 
product should be used as host cells for the expression of 
5 pl90. Mammalian cell lines are preferred. Such host cell 
lines may include but are not limited to CHO, VERO, BHK, 
HeDa, COS, MDWCK, -293, WI38, etc. Alternatively, eukaryotic 
host cells which possess some but not all of the cellular 
machinery rec[uired for optional processing of the primary 

10 transcript and/or post-trans lational processing and/or 

secretion of the gene product may be modified to enhance the 
host cell's processing capabilities* For example, a 
recombinant nucleotide sequence encoding a peptide product 
that performs a processing function the host cell had not 

15 previously been capable of performing, may be engineered into 
the host cell line. Such a sequence may either be co-trans- 
fected into the host cell along with the gene of interest, or 
included in the recombinant construct encoding the gene of 
interest. Alternatively, cell lines containing this sequence 

20 may be produced which are then transfected with the gene of 
interest . 

Appropriate exiJcaryotic expression vectors should be 
utilized to direct the expression of pl90 in the host cell 
chosen. For example, at least two basic approaches may be 

25 followed for the design of vectors based on SV40. The first 
is to replace the SV40 early region with the gene of interest 
while the second is to replace the late region (Hammarskjold, 
et al., 1986, Gene, 43:41-50. Early and late region 
replacement vectors can also be complemented in vitro by the 

30 appropriate SV40 mutant lacking the early or late region. 
Such complementation will produce recombinants which are 
packaged into infectious capsids and which contain the pl90 
gene. A permissive cell line can then be infected to produce 
the recombinant protein. SV40-based vectors can also be used 

35 in transient expression studies, where best results are ob- 
tained when they are introduced into COS (CV-l, origin of 
SV40) cells, a derivative of CV-l (green monkey kidney cells) 
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which contain a single copy of an origin defective SV40 
genome integrated into the chromosome. These cells actively 
synthesize large T antigen (SV40) , thus initiating 
replication from any plasmid containing an SV40 origin of 
5 replication. 

In addition to SV40, almost every molecular ly cloned 
virus or retrovirus may be used as a cloning or expression 
vehicle. Viral vectors based on a number of retroviruses 
(avian and murine) , adenoviruses, vaccinia virus (Cochran, et 
10 al., 1985, Proc. Natl. Acad. Sci. USA, 82:19-23) and polyoma 
virus may be used for expression. Other cloned viruses, such 
as J C (Howley, et al., 1980, J. Virol, 36:878-882), BK and 
the human papilloma viruses (Heilmsan, et al., 1980, J. 
Virol, 36:395-407), offer the potential of being used as 
15 eukaryotic expression vectors. For example, when using 

adenovirus expression vectors, the pl90 coding sequence may 
be ligated to an adenovirus transcription/ translation control 
complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the 
20 adenovirus genome by in vitro or in vivo recombination. 
Insertion in a non-essential region of the viral genome 
(e.g., region El or E3) will result in a recombinant virus 
that is viable and capable of expressing the htunan enzyme in 
infected hosts (e.g., see Logan & Shenk, 1984, Proc. Natl. 
25 Acad. Sci. USA, 81:3655-3659). Alternatively, the vaccinia 
virus 7.5K promoter may be used, (e.g., see, Hackett et al., 
1982, Proc. Natl. Acad. Sci. USA, 79:7415-7419; Hackett et 
al., 1994, J. Virol. 49:857-864, Panicali et al., 1982, Proc. 
Natl. Acad. Sci. USA, 79:4927-4931). Of particular interest 
30 are vectors based on bovine papilloma virus (Sarver, et al., 
1981, Mol. Cell. Biol., 1:486-496), or Semliki Forest Virus, 
which provides large quantities of active protein in induced 
cells (Olkkohnen et al. , 1994, Meth. Cell. Biol., 43 part 
A:43-53; Lundstrum et al., 1994, Eur. J. Biochem., 224:917- 
35 921) . These vectors have the ability to replicate as 

extrachromosomal elements. Shortly after entry of this DNA 
into mouse cells, the plasmid replicates to about lOO to 200 
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copies per cell. Transcription of the inserted cDNA does not 
require integration of the plasmid into the host's 
chromosome, thereby yielding a high level of expression- 
These vectors can be used for stable expression by including 
5 a selectable marker in the plasmid, such as the neo gene. 
High level expression may also be achieved using inducible 
promoters such as the metallothionine II A promoter, heat 

shock promoters, etc. 

For long-term, high^^yield production of recombinant 

10 proteins, stable expression is preferred. For example, 

following the introduction of foreign DNA, engineered cells 
may be allowed to grow for 1-2 days an enriched media, and 
then are switched to a selective media. Rather than using 
expression vectors which contain viral origins of 

15 replication, host cells can be transformed with the pl90 DNA 
controlled by appropriate expression control elements (e.g., 
promoter, enhancer, sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. The 
selectable marker in the recombinant plasmid confers 

20 resistance to the selection and allows cells to stably 

integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell 
lines. A number of selection systems may be used, including 
but not limited to the herpes simplex virus thymidine kinase 

25 (Wigler, etal., 1977, Cell, 11:223-232), hypoxanthine- 
guanine phosphoribosylatransf erase (Szybalska & Szybalski, 
1962, Proc. Natl. Acad. Sci. USA, 48:2026), and adenine 
phosphoribosyltransf erase (Lowy, et al., 1980, Cell, 22:817- 
823) genes can be employed in tk", hgprf or aprt" cells 

30 respectively. Also, antimetabolite resistance can be used as 
the basis of selection for dhfr, which confers resistance to 
methotrexate (Wigler, et al. , 1980, Natl. Acad. Sci. USA 
77:3567-3570; O'Hare, et al., 1981, Proc. Natl. Acad. Sci. 
USA 78:1527-1531); ygpt, which confers resistance to 

35 mycophenolic acid (Mulligan £e Berg, 1981, Proc. Natl. Acad. 
Sci. USA, 78:2072-2076) ; neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. 
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Biol., 150:1-14); and hygro, which confers resistance to 
hygromyciin (Santerre, et al., 1994, Gene, 30:147-156) genes- 
Recently, additional selectable genes have been described, 
namely trpB, which allows cells to utilize indole in place of 
5 tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, 1988, Proc. Natl* 
Acad. Sci. USA, 85:8047-8051), and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2-(dif luromethyl) -DL-ornithine, DFMO 
10 (HcConlogue L. , 1987, In: Current Communications in Molecular 
Biology, Cold Spring Harbor Laboratory ed.). 

Alternative eukaryotic expression systems which may be 
used to express the pl90 enzymes are yeast transformed with 
recombinant yeast expression vectors containing the pl90 
X5 coding secpience; insect cell system infected with recombinant 
virus expression vectors (e.g., baculovirus) containing the 
pl90 coding sequence; or plant cell systems infected with 
recombinant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic, TMV) or transformed with 
20 recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing the pl90 coding sequence. 

In yeast, a number of vectors containing constitutive or 
inducible promoters may be used. For a review see, Cxxrrent 
Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et 
25 al., Greene Publish. Assoc. & Wiley Interscience, Ch. 13; 
Grant et al., 1987, Expression and Secretion Vectors for 
Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 31987, 
Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA 
Cloning, Vol. II, IRL Press, Wash., D.C., Ch. 3; Bitter, 
30 1987, Heterologous Gene Expression in Yeast, Methods in 

Enzymology, Eds. Berger & Kimmel Acad. Press, N.Y., Vol. 152, 
pp. 673-694; and The Molecular Biology of the Yeast 
Saccharomyces , 1982, Eds. Strathem et al.. Cold Spring Harbor 
Press, Vols. I and II- For complementation assays in yeast, 
35 cDNAs for pl90 may be cloned into yeast episomal plasmids 
(YEp) which replicate autonomously in yeast due to the 
presence of the yeast 2m circle. The cDNA may be cloned 
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behind either a constitutive yeast promoter such as ADH or 
I*EU2 or an Inducible promoter such as GAL (Cloning In Yeast, 
Chpt. 3, R. Rothsteln In: DNA Cloning Vol. li, A Practical 
Approach, Ed. DM Glover, 1986, IRL Press, Wash., D.C.). 
5 Constructs may contain the 5» and 3' non-translated regions 
of the cognate pl90 mRNA or those corresponding to a yeast 
gene. YEp plasmlds transform at high efficiency and the 
plasmlds are extremely stable. Alternatively, vectors may be 
used which promote Integration of foreign DNA sequences Into 

10 the yeast chromosome. 

Alternately, active, post-translatlonally modified pl90 
proteins can be obtained using a yeast expression system such 
as the Pichla pastor is expression system marketed by 
Invitrogen {Plchia pastoris is o%med and licensed by Research 

IS corporation Technologies, Tucson, AZ; however, all components 
are available from Invitrogen, San Diego, CA) . In this 
example, cDNAs encoding himan pl90 are independently cloned 
into the pHII>-D2 Plchist expression vector- After 
linearization with a restriction endonuclease , these 

20 constructs are transfected into spheroblasts of the his4 

Plchia pastoris strain, GS115, and recombinant yeast carrying 
the cloned pl90 DNA sequences are identified by screening for 
yeast clones that grow in the absence of histidine (now 
supplied by the recombinant vector) , but do not efficiently 

25 utilize methanol as the sole carbon source (due to the 

presence of pl90 in the place of AOXI gene sequence coding 
for methanol utilization) . After expansion of such clones in 
the presence of an alternative carbon source such as 
glycerol, large quantities of cells would be transferred to 

30 liquid media containing methanol where replication ceases. 
However, cells remain viable for many days during which time 
human pl90 proteins are specifically expressed at high levels 
under control of the AOXI promoter. The advantages of this 
system include very high protein yields and lower expense in 
35 the production and maintenance of cultures. 

In cases where plant expression vectors are used, the 
expression of the pl90 coding sequence may be driven by any 
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Of a number of promoters. For example, viral promoters such 
as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al., 
1984, Nature, 310:511-514), or the coat protein promoter of 
THV (Takamatsu et al. , 1987, EMBO J- , 6:307-311) may be used; 
5 alternatively, plant promoters such as the small subunit of 
RUBISCO (Comzzi et al-, 1994, EMBO J., 3:1671-1680; Broglie 
et al-, 1984, Science, 224:838-843); or heat shock promoters, 
eg., soybean hsp 17.5-E or hsp 17-3-B (Gurley et al-, 1986, 
Mol. Cell- Biol., 6:559-565) may be used. These constructs 
10 can be introduced into plant cells using Ti plasmids, Ri 
plasmids, plant virus vectors; direct DNA transformation; 
microinjection, electroporation, etc- For reviews of such 
techniques see, for example, Weissbach & Weissbach, 1988, 
Methods for Plant Molecular Biology, Academic Press, NY, 
15 Section VIII, pp. 421-463; and Grierson & Corey, 1988, Plant 
Molecular Biology, 2d Ed-, Blackie, London, Ch. 7-9. 

An alternative expression system which could be used to 
express pl90 is an insect system. In one such system, 
Aata^retpha californica nuclear polyhedrosis virus (AcNPV) is 
20 used as a vector to express foreign genes. The virus grows 
in SpodoptBra frugiperda cells. The pl90 sequence may be 
cloned into non-essential regions (for example the polyhedrin 
gene) of the virus and placed under control of an AcNPV 
promoter (for example the polyhedrin promoter) . Successful 
25 insertion of the coding sequence will result in inactivation 
of the polyhedrin gene and production of non-occluded re- 
combinant virus (i-e., virus lacking the proteinaceous coat 
coded for by the polyhedrin gene) . These recombinant viruses 
are then used to infect SpodoptBra fruglperda cells in which 
30 the inserted gene is expressed, (e.g., see Smith et al., 
1983, J- Virol., 46:584, Smith, U.S. Pat. No. 4,215,051). 

In a specific embodiment of an insect system, the DNA 
encoding pl90 can be independently cloned into the 
pBlueBacIlI recombinant transfer vector (Invitrogen, San 
35 Diego, CA) downstream of the polyhedrin promoter and 

transfected into Sf9 insect cells (derived from SpodoptBra 
frugipBrda ovarian cells, available from Invitrogen, San 
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Diego, CA) to generate recombinant virus containing pl90. 
After plaque purification of the recombinant virus high-titer 
viral stocks are prepared that in turn would be used to 
infect Sf9 or High Five'" (BTI-TN-5B1-4 cells derived from 
5 Trlchoplusia ni egg cell homogenates; available from 

Invitrogen, San Diego, CA) insect cells, to produce large 
quantities of appropriately post-trans lationally modified 
pl90 proteins. Although it is possible that these cells 
themselves could be directly useful for drug assays, the pl90 
XO proteins prepared by this method can be used for in vitro 
assays of drug potency and selectivity. 

5.4.2 IDEHTIFICATION OP TRAH8FBCTANT8 
OR TRAN8FORMAKT8 EZPRE88IN6 
THE P190 QEKE PRODUCT 

The host cells which contain the pl90 coding sequence 
and which express the biologically active gene product may be 
identified by at least four general approaches: (a) DNA-DNA 
or DNA-RNA hybridization; (b) the presence or absence of 
"marker" gene functions; (c) assessing the level of 
*® transcription as measured by the expression of pl90 mRNA 

transcripts in the host cell; and (d) detection of the gene 
product as measured by immunoassay or by its biological 
activity. 

In the first approach, the presence of the pl90 coding 
sequence inserted in the expression vector can be detected by 
DNA-DNA or DNA-RNA hybridization or PGR using probes 
comprising nucleotide sequences that are homologous the pl90 
coding sequence or portions or derivatives thereof. 

In the second approach, the recombinant expression 
vector /host system can be identified and selected based upon 
the presence or absence of certain "marker" gene functions 
(e.g., resistance to antibiotics, resistance to methotrexate, 
transformation phenotype, occlusion body foirmation in 
baculovirus, etc.). For example, if the pl90 coding sequence 
35 within a marker gene sequence of the vector, recombinants 
containing the pl90 coding sequence can be identified by the 
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absence of the marker gene function. Alternatively, a marker 
gene can be placed in tandem with the pl90 sequence under the 
control of the same or different promoter used to control the 
expression of the pl90 coding sequence. Expression of the 
5 marker in response to induction or selection indicates 
expression of the pl90 coding sequence. In addition, the 
marker gene may be identified by DNA-DNA or DNA-RNA 
hybridization or PGR. 

In the third approach, transcriptional activity for the 
10 pl90 coding region can be assessed by hybridization or PGR 
assays. For example, RNA can be isolated and analyzed by 
Northern blot using a probe homologous to the pl90 coding 
sequence or particular portions thereof. Alternatively, 
total nucleic acids of the host cell may be extracted and 
15 assayed for hybridization to such probes. 

In the fourth approach, the expression of the pl90 
protein product can be assessed immunologically, for example 
by Western blots, immunoassays such as radioimmuno- 
precipitation, enzyme-linked immunoassays and the like. The 
20 ultimate test of the success of the expression system, 

however, involves the detection of the biologically active 
pl90 gene product. Where the host cell secretes the gene 
product, the cell free media obtained from the cultured 
transfectant host cell may be assayed for pl90 activity. 
25 Where the gene product is not secreted, cell lysates may be 
assayed for such activity. In either case, a number of 
assays can be used to detect pl90 activity, including but not 
limited to, those described in the examples infra or those 
known in the art. 

30 

5.4.3 GEI,!. IiIKBS EXPRg ggTMC Pl90 

The present invention also relates to cell lines 
containing recombinant DNA sequence, preferably a 
chromosomally integrated recombinant DNA sequence, which 
35 comprises the gene encoding pl90 which cell lines further do 
not express autologous pl90, apart from that encoded by the 
recombinant DNA sequence. 
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A specific embodiment of the present invention is an 
engineered mammalian cell line which contains a chromosomally 
integrated, genetically-engineered ("recombinant") DNA 
sequence, which DNA sequence expresses mammalian pl90, and 
5 wherein said cell line also does not express autologous pl90. 
The cell line is preferably of hiiman or primate origin, such 
as the exemplified monkey kidney COS cell line, but cell 
lines derived from other species may be employed, including 
chicken, hamster, murine, ovine and the like; the CHO 

* 

10 (Chinese hamster ovary) cell line for example, may be 
particularly preferred for large scale production. 

Any cell or cell line, the genotype of which has been 
altered by the presence of a recombinant DNA sequence is 
encompassed by the invention. The recombinant DNA sequence 

15 may also be referred to herein as "heterologous DNA," 

"exogenous DNA," "genetically engineered" or "foreign DNA," 
indicating that the DNA was introduced into the genotype or 
genome of the cell or cell line by a process of genetic 
engineering . 

20 The invention includes, but is not limited to, a cell or 

cell line wherein the native pl90 DNA sequence has been 
removed or replaced as a result of interaction with a 
recombinant DNA sequence. Such cells are called pl90 
knockouts, herein, if the resulting cell is left without a 

25 native DNA that encodes a functional pl90 gene product. 

As used herein, the term "recombinant DNA sequence" 
refers to a DNA sequence that has been derived or isolated 
from any source, that may be subsequently chemically altered, 
and later introduced into mammalian cells. An example of a 

30 recombinant DNA sequence "derived" from a source, would be a 
DNA sequence that is identified as a useful fragment within a 
given organism, and which is then chemically synthesized in 
essentially pure form. An example of such DNA sequence 
"isolated" from a source would be a DNA sequence that is 

35 excised or removed from said source by chemical means, e.g., 
by the use of restriction endonucleases , so that it can be 
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further manipulated^ e.g., amplified, for use in the 
invention, by the methodology of genetic engineering. 

Therefore, "recombinant DNA sequence" includes 
completely synthetic DNA, semi-synthetic DNA, DNA isolated 
5 from biological sources, and DNA derived from introduced RNA. 
Generally, the recombinant DNA sequence is not originally 
resident in the genotype which is the recipient of the DNA 
sequence, or it is resident in the genotype but is not 
expressed. 

10 The isolated recombinant DNA sequence used for 

transformation herein may be circular or linear, double- 
stranded or single-stranded. Generally, the DNA secjuence is 
chimeric linear DNA, or is a plasmid or viral expression 
vector, that can also contain coding regions flanked by 
15 regulatory sequences which promote the expression of the 
recombinant DNA present in the resultant cell line. For 
example, the recombinant DNA sequence may itself comprise or 
consist of a promoter that is active in mammalian cells, or 
may utilize a promoter already present in the genotype that 
20 is the transformation target. Such promoters include, but 

are not limited to, the CMV promoter, SV 40 late promoter and 
retroviral LTRs (long terminal repeat elements) . 

The general methods for constructing recombinant DNA 
which can transform target cells are well known to those 
25 skilled in the art, and the same compositions and methods of 
construction may be utilized to produce the DNA useful 
herein. For example, J. Sambrook et al. , Molecular Cloning; 
A liaboratory Manual, Cold Spring Harbor Laboratory Press (2d 
ed., 1989), provides suitable methods of construction. 
30 Aside from recombinant DNA sequence that serve as 

transcription units for pl90 or other portions thereof, a 
portion of the recombinant DNA may be untranscribed, serving 
a regulatory or a structural function. 

The recombinant DNA sequence to be introduced into the 
35 cells further will generally contain either a selectable 
marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells. 
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Alternatively, the selectable marker may be carried on a 
separate piece of DNA and used in a co-transformation 
procedure. Both selectable markers and reporter genes may be 
flanked with appropriate regulatory sequences to enable 
5 expression in mammalian cells. Useful selectable markers are 
well known in the art and include, for example, anti-biotic 
and herbicide resistance genes. 

Sources of DNA sequences useful in the present invention 
include Poly-A RNA from mammalian cells, from which the mRNA 
10 encoding pl90 can be derived and used for the synthesis of 
the corresponding cDNA by methods known to the art. Such 
sources include cDNA libraries and mRNA pools made from 
neuronal, neuroblastoma, embryonic, fetal, and hematopoietic 
tissues of human, rat or other mammalian origin. 

Selectable marker genes encoding enzymes which impart 
resistance to biocidal compounds are listed in Table 1, 
below. 

Table 3 



20 



25 



Resistance 
Gene or Enzvme 

Neomycin phospho- 
transferase (neo) 



Hygromycin 
phosphotrans- 
ferase (hpt or 
hyg) 

Dihydrof olate 
reductase (dhfr) 



Resistan ce to; 

G-418, neomycin, 
kanamycin 



Hygromycin B 



Reference 



30 



Phosph inothr ic in Phosph inot hr ic in 



(bar) 

2 , 2-Dichloropro 
35 Pi^^^fc 



2-2 , Dichloropro- 
pionic acid 
(Dalapon) 



al., 1982, J. Mol. 
Appl. Gen., 1:327-341 

Shimizu et al.^ 1986, 
Mol. Cell Biol. , 
6: 1074-1087 



Kwok et al. , 1986, 
Proc . Nat • 1 . Acad . 

Sci. USA, 4552-4555 

DeBlock et al., 1987, 
BO J. , 6:2513-2 



Mil 



Buchanan-Wollaston 
al., 1989, J. 
Biochem. , Supp 
330 



13D, 
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Acetohydr oxyac id 



Sulfonylurea , 
imidazolinone and 
triazolopyrimidine 



5 - Eno Ipy r u vy 1 
shiklina'tephos 



Glyphosate 



(aroA) 



Haloarylnitrilase Bromoxynil 



10 



Acetyl-coenzyme A 



Dihydropteroate 
15 synthase (sul I) 



Sethoxydim, 
haloxyf op 



Sulfonamide 



Anderson et al. (U«S 
Patent No. 

4,761,373) ; G.W. 
Haughn et al., 1988 
Mol. Gen. Genet., 
211:266-271 

Comai et al., 1985 
Nature , 317:741-744 



Stalker et al., 
published PCT appln 
W087/04181 

Parker et al., 1990 
Plant Physiol., 
92:1220 



et al. , 
1990, Plant Molec 

Biol. , 15:127-136 



32 kD photosystem Triazine herbicides Hirschberg 



20 



(psbA) 

Anthr an i 1 ate 
synthase 

D ihy dr od i p i co 1 i n* 
ic acid synthase 
(dap A) 



et al. , 
1983, Science, 
222:1346-1349 



5-Methyltryptophan Hibberd et al. (U.S. 

Patent No. 4,581,847) 



Aminoethyl cyst 



eine Glassman et al* 
published PCT 
application No. 
W089/11789 



25 



transformed cells and for 



regulatory sequences 



30 



i a 

expressed by 
encodes a 
easily 



used for identifying potentially 
evaluating the functionality of 
Reporter genes which encode for easily 
are well known in the art. In 

a gene which is not present in or 
organism or tissue and which 



35 



whose expression is manifested by some 
property, e.g., enzymatic activity, 
includes the chloramphenicol acetyl 
(cat) from Tn9 of E. coll, the beta- 
gene of E. coli, the beta-glucuronidase gene 
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(gus) of tOie uldA locus of E. coli, and the lucif erase gene 
from firefly Photinus pyralis. Expression of the reporter 
gene is assayed at a suitable time after the DNA has been 
introduced into the recipient cells. 
5 Other elements such as introns, enhancers, 

polyadenylation sequences and the like, may also be a part of 
the recombinant DNA sequence. Such elements may or may not 
be necessary for the function of the DNA, but may provide 
improved expression of the DNA by affecting transcription, 

10 stability of the mRNA, or the like. Such elements may be 
included in the DNA as desired to obtain the optimal 
performance of the transforming DNA in the cell. 

The recombinant DNA sequence can be readily introduced 
into the target cells by trans feet ion with an expression 

15 vector, such as a viral expression vector, comprising cDNA 
encoding pl90 by the modified calciiim phosphate precipitation 
procedure of Chen et al. , 1987, Mol. Cell. Biol., 7:2745- 
2752. Transfection can also be accomplished by other 
methods, including lipofection, using commercially available 

20 kits, e.g., provided by Life Technologies. 

In a preferred embodiment of the invention, the cell 
lines of the invention are able to express a stable pl90 gene 
product or analog, homologue, or deletion thereof after 
several passages through cell culture. 

25 

5.4 .4 PURIFICATION OF THE pl90 OENE 

PRODUCT 



Once a cell that produces high levels of biologically 
active pl90 is identified, the cell may be clonally expanded 

30 and used to produce large quantities of the enzyme, which may 
be purified using techniques well-known in the art including, 
but not limited to, immunoaf f inity purification, 
chromatographic methods including high perfoirmance liquid 
chromatography and the like. Where the enzyme is secreted by 

35 the cultured cells, pl90 may be readily recovered from the 
culture medium. 
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Where the pl90 coding sequence, or fragment thereof, has 
been engineered to encode a cleavable fusion protein, the 
purification of the pl90 gene product, or fragment thereof, 
may be readily accomplished using affinity purification 
5 technicjues. For example, an antibody specific for the 
heterologous peptide or protein can be used to capture the 
dtirable fusion protein; for example, on a solid surface, a 
column etc. The pl90 moiety can be released by treatment 
with the appropriate enzyme that cleaves the linkage site. 

10 cDNA construction using the polymerase chain reaction 

accompanied by transfection and purification of the expressed 
protein permits the isolation of sufficient quantities of 
pl90 for characterization of the enzyme's physical and 
kinetic properties. Using site-directed mutagenesis or 

15 naturally occurring mutant sequences, this system provides a 
reasonable approach to determine the effects of the altered 
primary structure on the function of the protein. Fusion 
constructs of the pl90 protein domain with the marker peptide 
preceding the amino terminus of pl90 or following the carboxy 

20 terminus of pl90 may also be engineered to evaluate which 
fusion construct will interfere the least, if at all, with 
the protein's biologic function and the ability to be 



Using this aspect of the invention, any cleavage site or 
25 enzyme cleavage substrate may be engineered between the pl90 
sequence and a second peptide or protein that has a binding 
partner which could be used for purification, e.g, any 
antigen for which an immunoaf f inity column can be prepared. 

30 5.5 I^IBODIES TO TH E nl90 GENE PRQDiyCT 

For the production of antibodies, various host animals 
may be immunized by injection with the pl90 gene product, or 
a portion thereof including, but not limited to, portions of 
the pl90 gene product in a recombinant protein. Such host 
35 animals may include but are not limited to rabbits, mice, and 
rats, to name but a few. Various adjuvants may be used to 
increase the immunological response, depending on the host 
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species, including but not limited to Freund's (complete and 
incomplete) , mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 
5 hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacille Calmette-Guerin) and 
CorynebactBrium parvum. 

Monoclonal antibodies may be prepared by using any 
technique which provides for the production of antibody 

10 molecules by continuous cell lines in culture. These include 
but are not limited to the hybridoma technique originally 
described by Kohler and Milstein, 1975, Nature, 256:495-497, 
the hximan B-cell hybridoma technique (Kosbor et al., 1983, 
Immunology Today, 4:72, Cote et al. , 1983, Proc* Natl. Acad. 

15 Sci-, 80:2026-2030) and the EBV-hybridoma technique (Cole et 
al., 1985, Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96). In addition, techniques developed 
for the production of "chimeric antibodies" (Morrison et al., 
1984, Proc. Natl. Acad. Sci., 81:6851-6855; Neuberger et al., 

20 1984, Nature, 312:604-608; Takeda et al. , 1985, Nature, 
314:452-454) by splicing the genes from a mouse antibody 
molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate 
biological activity can be used. Alternatively, techniques 

25 described for the production of single chain antibodies (U.S. 
Patent No. 4,946,778) can be adapted to produce single chain 
antibodies specific to one of the binding partners. 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such 

30 fragments include but are not limited to: the F(ab«)2 

fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(abM2 
fragments. Alternatively, Fab expression libraries may be 

35 constructed (Huse et al. , 1989, Science, 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 
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THERAPIES 



A variety of gene therapy approaches may be used in 
accordance with the invention to modulate expression of the 
pl90 gene In vivo. For example, antisense DNA molecules may 

5 be engineered and used to block translation of pl90 mRNA in 
vivo. Alternatively, ribozyme molecules may be designed to 
cleave and destroy the pl90 mRNAs In vivo. In another 
alternative, oligonucleotides designed to hybridize to the 5» 
region of the pl90 gene (including the region upstream of the 

10 coding sequence) and form triple helix structures may be used 
to block or reduce transcription of the pl90 gene. In yet 
another alternative, nucleic acid encoding the full length 
wild-type pl90 message may be introduced In vivo into cells 
which otherwise would be unable to produce the wild-type pl90 

15 gene product in sufficient quantities or at all- 
in a preferred embodiment, the antisense, ribozyme and 
triple helix nucleotides are designed to inhibit the 
translation or transcription of pl90. To accomplish this, 
the oligonucleotides used should be designed on the basis of 

20 relevant sequences unique to pl90. For example, and not by 
way of limitation, the oligonucleotides should not fall 
within those regions where the nucleotide sequence of pl90 is 
most homologous to that of other known proteins. 

Instead, it is preferred that the oligonucleotides fall 

25 within the regions of pl90, which diverge from the sequence 

of other known proteins. 

In the case of antisense molecules, it is preferred that 
the sequence be chosen from those divergent sequences just 
mentioned above. It is also preferred that the sequence be 
30 at least 18 nucleotides in length in order to achieve 

sufficiently strong annealing to the target mRNA sequence to 
prevent translation of the sequence. Izant and Weintraub, 
1984, cell, 36:1007-1015; Rosenberg et al. , 1985, Nature, 
313:703-706, 

In the case of the "hammerhead" type of ribozymes, it is 
also preferred that the target sequences of the ribozymes 
chosen from the above-mentioned divergent sequences. 
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Ribozymes are RNA molecules which possess highly specific 
endoribonuclease activity. Hammerhead ribozymes comprise a 
hybridizing region which is complementary in nucleotide 
sequence to at least part of the target RNA, and a catalytic 
5 region which is adapted to cleave the target RNA. The 
hybridizing region contains nine (9) or more nucleotides. 
Therefore, the hammerhead ribozymes of the present invention 
have a hybridizing region which is complementary to the 
sequences listed above and is at least nine nucleotides in 

10 length. The construction and production of such ribozymes is 
well known in the art and is described more fully in Haseloff 
and Gerlach, 1988, Nature, 334:585-591. 

The ribozymes of the present invention also include RNA 
endoribonucleases (hereinafter "Cech-type ribozymes") such as 

15 the one which occurs naturally in Tetrahymena Thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been 
extensively described by Thomas Cech and collaborators (Zaug, 
et al., 1984, Science, 224:574-578; Zaug and Cech, 1986, 
Science, 231:470-475; Zaug, etal., 1986, Nature, 324:429- 

20 433; published International patent application No. WO 
88/04300 by University Patents Inc.; Been and Cech, 1986, 
Cell, 47:207^216). The Cech endoribonucleases have an eight 
base pair active site which hybridizes to a target RNA 
sequence whereafter cleavage of the target RNA takes place. 

25 The invention encompasses those Cech-type ribozymes which 
target eight base-pair active site sequences that are 

exclusive to pl90. 

In the case of oligonucleotides that hybridize to and 
form triple helix structures at the 5* terminus of the pl90 

30 gene and can be used to block transcription, it is preferred 
that they be complementary to those sequences in the 5' 
terminus of pl90 which are not present in other related 
proteins. However, it is preferred that the sequences not 
include those regions of the pl90 promoter which are even 

35 slightly homologous to that of other known proteins. 

The foregoing compounds can be administered by a variety 
of methods which are known in the art including, but not 
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limited to the use of liposomes as a delivery vehicle. Naked 
DNA or RNA molecules may also be used where they are in a 
form which is resistant to degradation such as by 
modification of the ends, by the formation of circular 
5 molecules, or by the use of alternate bonds including 
phosphothionate and thiophosphoryl modified bonds. In 
addition, the delivery of nucleic acid may be by facilitated 
transport where the nucleic acid molecules are conjugated to 
poly- lysine or transferrin. Nucleic acid may also be 
10 transported into cells by any of the various viral carriers, 
including but not limited to, retrovirus, vaccinia, AAV, and 
adenovirus . 

Alternatively, a recombinant nucleic acid molecule which 
encodes, or is, such antisense, ribozyme, triple helix, or 
IS pl90 molecule can be constructed. This nucleic acid molecule 
may be either RNA or DNA. If the nucleic acid encodes an 
RNA, it is preferred that the sequence be operatively 
attached to a regulatory element so that sufficient copies of 
the desired RNA product are produced. The regulatory element 
20 may permit either constitutive or regulated transcription of 
the sequence. In vivo, that is, within the cells or cells of 
an organism, a transfer vector such as a bacterial plasmid or 
viral RNA or DNA, encoding one or more of the RNAs, may be 
transfected into cells e.g. (Llewellyn et al., 1987, J. Mol. 
25 Biol., 195:115-123; Hanahan et al. 1983, J. Mol. Biol., 

166:557-580). Once inside the cell, the transfer vector may 
replicate, and be transcribed by cellular polymerases to 
produce the RNA or it may be integrated into the genome of 
the host cell. Alternatively, a transfer vector containing 
30 sequences encoding one or more of the RNAs may be transfected 
into cells or introduced into cells by way of 
micromanipulation techniques such as microinjection, such 
that the transfer vector or a part thereof becomes integrated 
into the genome of the host cell. 

35 
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5.7 DRUG flCB fEMXMQ ABSAYB 

The present invention provides a simple in vitro system 
for the screening of drug actions on pl90, which will be 
useful for the development of drugs that modulate the growth, 
5 differentiation or survival of neurons. Assays can be 
performed on living mammalian cells, which more closely 
approximate the effects of a particular sexrum level of drug 
in the body, or on microsomal extracts prepared from the 
cultured cell lines. Studies using microsomal extracts offer 
10 the possibility of a more rigorous determination of direct 
drug/enzyme interactions. 

The pl90-synthesi2ing cell lines are useful for 
evaluating the activity of potential bioactive agents on 
pl90. 

15 The present invention also provides a second mammalian 

cell line which contains a chromosomally integrated, 
recombinant DNA sequence, wherein said DNA sequence expresses 
msunmalian, pl90, and wherein said cell line also preferably 
does not express autologous pl90 activity. This second cell 

20 line is also preferably a primate, murine or human cell line. 

Thus, the present invention also provides a method to 
evaluate. 

The invention also relates to methods for the 
identification of genes, termed "pathway genes", which are 

25 associated with the pl90 gene product or with the biochemical 
pathways which extend therefrom. "Pathway gene", as used 
herein, refers to a gene whose gene product exhibits the 
ability to interact with the pl90 gene product. 

Any method suitable for detecting protein-protein 

30 interactions may be employed for identifying pathway gene 
products by identifying interactions between gene products 
and the pl90 gene product. Such known gene products may be 
cellular or extracellular proteins. Those gene products 
which interact with such known gene products represent 

35 pathway gene products and the genes which encode them 
represent pathway genes. 
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Among the traditional methods which may be employed are 
co-immunoprecipitation, cross linking and co-purification 
through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
5 pathway gene products* Once identified^ a pathway gene 

product may be used, in conjunction with standard techniques, 
to identify its corresponding pathway gene. For example, at 
least a portion of the amino acid sequence of the pathway 
gene product may be ascertained using techniques well known 

10 to those of skill in the art, such as via the Edman 

degradation technique (see, e.g., Creighton, 1983, Proteins: 
Structures and Molecular Principles, W.H. Freeman & Co., 
N.Y., pp. 34-49). The amino acid sequence obtained may be 
used as a guide for the generation of oligonucleotide 

15 mixtures that can be used to screen for pathway gene 

sequences. Screening made be accomplished, for example by 
standard hybridization or PGR techniques. Techniques for the 
generation of oligonucleotide mixtures and screening are 
well-known. (See, e.g., Ausubel et al., eds. , 1987-1993, 

20 Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc. New York, and PGR Protocols: A Guide to Methods and 
Applications, 1990, Innis, M. et al., eds. Academic Press, 

Inc . , New York) . 

Additionally, methods may be employed which result in 
25 the simultaneous identification of pathway genes which encode 
the protein interacting with the pl90 gene product. These 
methods include, for example, probing expression libraries 
with labeled protein known or suggested to be involved in 
cardiovascular disease, using this protein in a manner 
30 similar to the well known technic[ue of antibody probing of 
Xgtll libraries. 

One such method which detects protein interactions in 
vivo, the two-hybrid system, is described in detail for 
illustration only and not by way of limitation. One version 
35 of this system has been described (Ghien et al., 1991, Proc. 
Natl. Acad, Sci. USA, 88:9578-9582) and is commercially 
available from Glontech (Palo Alto, CA) . 
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Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: one consists of 
the DNA-binding domain of a transcription activator protein 
fused to a known protein, and the other consists of the 
5 activator protein's activation domain fused to an unknown 
protein that is encoded by a cDNA which has been recombined 
into this plasmid as part of a cDNA library. The plasmids 
are transformed into a strain of the yeast Saccharomyces 
cerevisiaB that contains a reporter gene (e.g., lacZ) whose 

10 regulatory region contains the activator's binding sites. 
Either hybrid protein alone cannot activate transcription of 
the reporter gene: the DNA-binding domain hybrid because it 
does not provide activation function and the activation 
domain hybrid because it cannot localize to the activator's 

15 binding sites. Interaction of the two proteins reconstitutes 
the functional activator protein and results in expression of 
the reporter gene, which is detected by an assay for the 
reporter gene product. 

The two-hybrid system or related methodology may be used 

20 to screen activation domain libraries for proteins that 

interact with the pl90 gene product, herein also called the 
known "bait" gene protein. Total genomic or cDNA sequences 
may be fused to the DNA encoding an activation domain. Such 
a libraxry and a plasmid encoding a hybrid of the bait gene 

25 protein fused to the DNA-binding domain may be cotransf ormed 
into a yeast reporter strain, and the resulting transf ormants 
may be screened for those that express the reporter gene. 
These colonies may be purified and the library plasmids 
responsible for reporter gene expression may be isolated. 

30 DNA sequencing may then be used to identify the proteins 
encoded by the library plasmids* 

For example, and not by way of limitation, the bait gene 
may be cloned into a vector such that it is trans lationally 
fused to the DNA encoding the DNA-binding domain of the GAL4 

35 protein. 

A cDNA library of the cell line from which proteins that 
interact with bait gene are to be detected can be made using 
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methods routinely practiced in the art. According to the 
particular system described herein, for example, the cDNA 
fragments may be inserted into a vector such that they are 
trans lationally fused to the activation domain of GAL4 . This 
5 library may be co-trans formed along with the bait gene-GAL4 
fusion plasmid into a yeast strain which contains a lacZ gene 
driven by a promoter which contains the GAL4 activation 
sequence. A cDNA encoded protein, fused to the GAI«4 
activation domain, that interacts with bait gene will 

10 reconstitute an active GAIi4 protein and thereby drive 

expression of the lac^ gene. Colonies which express lacZ may 
be detected by their blue color in the presence of X-gal, 
The cDNA may then be purified from these strains, and used to 
produce and isolate the bait gene-interacting protein using 

15 techniques routinely practiced in the art. 

Once a pathway gene has been identified and isolated, it 
may be further characterized as, for example, discussed 
herein. 

The proteins identified as products of pathway genes may 
20 be used to modulate pl90 gene expression, as defined herein, 
or may themselves be targets for modulation to in turn 
modulate symptoms associated with pl90 expression. 



5.8 COMPOUNPg IPBNTi yiED IN TOE SCREENS 

25 The compounds identified in the screen will demonstrate 

the ability to selectively modulate the expression of pl90. 
These compounds include but are not limited to nucleic acid 
encoding pl90 and homologues, analogues, and deletions 
thereof, as well as antisense, ribozyme, triple helix, 

30 antibody, and polypeptide molecules and small inorganic 
molecules. 



5.9 PHARMACEUTZCAIi FORMUIATXON8 
AND ROUTES OF A ^MTKI STRATI ON 

Any of the identified compounds can be administered to 
an animal host, including a human patient, by itself, or in 
pharmaceutical compositions where it is mixed with suitable 
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carriers or excipient(s) at doses therapeutically effective 
to treat or ameliorate a variety of disorders, including 
those characterized by insufficient, aberrant, or excessive 
pl90 activity or neurite growth, differentiation or survival, 

5 including but not limited to: ALS; general ataxia; 

Parkinson's disease; Alzheimer's disease; Huntington's 
disease; general neuropathy; cerebral palsy; neurologic 
trauma; and mental retardation. A therapeutically effective 
dose further refers to that amount of the compound sufficient 
10 to result in cunelioration of symptoms associated with such 
disorders. Techniques for formulation and administration of 
the compounds of the instant application may be found in 
"Remington's Pharmaceutical Sciences,** Mack Publishing Co., 
Easton, PA, latest edition. 

15 A number of disorders may be characterized by 

insufficient, aberrant, or excessive pl90 activity. In 
addition, several physiological states which may, from time 
to time be considered undesired, may also be associated with 
pl90 activity. By way of example, but not by way of 

20 limitation, such disorders and physiological states which may 
be treated with the compounds of the invention include but 
are not limited to those characterized by insufficient, 
aberrant, or excessive neurite growth, differentiation or 
survival, including but not limited to: ALS; general ataxia; 

25 Parkinson's disease; Alzheimer's disease; Huntington's 
disease; general neuropathy; cerebral palsy; neurologic 
trauma; and mental retardation. 

The compounds of the invention may be designed or 
administered for tissue specificity. If the compound 

30 comprises a nucleic acid molecule, including those comprising 
an expression vector, it may be linked to a regulatory 
sequence which is specific for the target tissue, such as the 
brain, skin, joints, bladder, kidney, liver, ovary, etc. by 
methods which are known in the art including those set forth 

35 in Hart, 1994, Ann. Oncol., 5 Suppl 4: 59-65; Dahler et al., 
1994, Gene, 145: 305-310; DiMaio et al. , 1994, Surgery, 
116:205-213; Weichselbaum et al.. Cancer Res., 54:4266-4269; 
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Harris et al. , 1994, Cancer, 74 (Suppl. 3 ): 1021-1025 ; 
Rettinger et al., Proc. Nat'l. Acad. Sci. USA, 91:1460-1464; 
and Xu et al, Exp. Hematol., 22:223-230; Brigham et al., 
1994, Prog. Clin. Biol. Res., 388:361-365. The compounds of 
5 the invention may be targeted to specific sites of 

inflammation by direct injection to those sites, such as 
joints, in the case of arthritis* Compounds designed for use 
in the central nervous system should be able to cross the 
blood brain barrier or be suitable for administration by 
10 localized injection. Similarly, compounds specific for the 
bladder can be directly injected therein. Compounds may also 
be designed for confinement in the gastrointestinal tract for 
use against disorders such as colorectal carcinoma* In 
addition, the compounds of the invention which remain within 
15 the vascular system may be useful in the treatment of 
vascular inflammation which might arise as a result of 
arteriosclerosis, balloon angioplasty, catheterization, 
myocardial infarction, vascular occlusion, and vascular 
surgery and which have already been associated with pl90 by 
20 Pritchard et al., 1994, J. Biol. Chem. , 269, 8504-8509. Such 
compounds which remain within the bloodstream may be prepared 
by methods well known in the art including those described 
more fully in Mclntire, 1994, Annals Biomed. Engineering, 
22:2-13. 

25 

5.9.1 EFFECTIV E DOSAGE 

Pharmaceutical compositions suitable for use in the 
present invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve 

30 its intended purpose. More specifically, a therapeutically 
effective amount means an amount effective to prevent 
development of or to alleviate the existing symptoms of the 
sxxbject being treated. Determination of the effective 
amounts is well within the capability of those skilled in the 

35 art, especially in light of the detailed disclosure provided 
herein. 
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For any compound used in the method of the i 
the therapeutically effective dose can be estimated initially 
from cell culture assays. For example, a dose can be 
formulated in animal models to achieve a circulating 
5 concentration range that includes the IC50 (the dose where 
50% of the cells show the desired effects) as determined in 
cell culture. Such information can be used to more 
accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount 
10 of the compound that results in amelioration of symptoms or a 
prolongation of survival in a patient. Toxicity and 
therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultxires or 
experimental animals, e.g., for determining the LD50 (the 
15 dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population) . The 
dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio 
between LD50 and ED50. Compounds which exhibit high 
20 therapeutic indices are preferred. The data obtained from 
these cell cultxire assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage 
of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little 
25 or no toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual 
physician in view of the patient's condition. (See e.g. 
30 Fingl et al., 1975, in "The Pharmacological Basis of 

Therapeutics", Ch. 1 pi) . Dosage amount and interval may be 
adjusted individually to provide plasma levels of the active 
moiety which are sufficient to maintain the desired effects. 
In cases of local administration or selective uptake, 
35 the effective local concentration of the drug may not be 
related to plasma concentration. 
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The amount of composition administered will, of course, 
be dependent on the subject being treated, on the subject's 
weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician* 



AMD PORMUIATION 



The pharmaceutical compositions of the present invention 
may be manufactured in a manner that is itself known, e.g., 
by means of conventional mixing, dissolving, granulating, 
XO dragee-medcing , levigating, emulsifying, encapsulating, 
entrapping or lyophilizing processes. 

Phaz-maceutical compositions for use in accordance with 
the present invention thus may be formulated in conventional 
manner using one or more physiologically acceptable carriers 
15 comprising excipients and auxiliaries which facilitate 

processing of the active compounds into preparations which 
can be used pharmaceutical ly. Proper formulation is 
dependent upon the route of administration chosen. 

For injection, the agents of the invention may be 
20 formulated in aqueous solutions, preferably in 

physiologically compatible buffers such as Hanks 's solution. 
Ringer's solution, or physiological saline buffer. For 
transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such 
25 penetrants are generally known in the art. 

For oral administration, the compounds can be formulated 
readily by combining the active compounds with 
pharmaceutically acceptable carriers well known in the art. 
Such carriers enable the compounds of the invention to be 
30 formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions and the like, for oral 
ingestion by a patient to be treated. Pharmaceutical 
preparations for oral use can be obtained solid excipient, 
optionally grinding a resulting mixture, and processing the 
35 mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable 
ipients are, in particular, fillers such as sugars. 
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including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat 
starch, rice starch, potato starch, gelatin, gxiin tragacanth, 
methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
5 carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP) . 
If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or 
a salt thereof such as sodi\im alginate* 

Dragee cores are provided with suitable coatings. For 

10 this purpose, concentrated sugar solutions may be used, which 
may optionally contain gum arable, talc, polyvinyl 
pyrrolidone, carbopol gel, polyethylene glycol, and/or 
titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. Dyestuffs or pigments may be 

15 added to the tablets or dragee coatings for identification or 
to characterize different combinations of active compound 



Pharmaceutical preparations which can be used orally 
include push-fit capsules made of gelatin, as well as soft, 

20 sealed capsules made of gelatin and a plasticizer, such as 
glycerol or sorbitol. The push-fit capsules can contain the 
active ingredients in admixture with filler such as lactose, 
binders such as starches, and/or lubricants such as talc or 
magnesiiim stearate and, optionally, stabilizers. In soft 

25 capsules, the active compounds may be dissolved or suspended 
in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in 
dosages suitable for such administration. 

30 For buccal administration, the compositions may take the 

form of tablets or lozenges formulated in conventional 
manner . 

For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
35 in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, 
e.g. , dichlorodif luoromethane, trichlorof luoromethane, 
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dichlorotetraf luoroethane, carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g. gelatin for use in 
5 an inhaler or insufflator may be formulated containing a 

powder mix of the compound and a suitable powder base such as 

lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g., by bolus injection or 

XO continuous infusion. Formulations for injection may be 

presented in tinit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 

15 formulatory agents such as suspending, stabilizing and/or 

dispersing agents. 

Pharmaceutical formulations for parenteral 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, suspensions 
20 of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or 
liposomes. Aqueous injection suspensions may contain 

25 substances which increase the viscosity of the suspension, 
such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the 
compounds to allow for the preparation of highly concentrated 

3 O SO lut ions . 

Alternatively, the active ingredient may be in powder 
form for constitution with a suitable vehicle, e.g., sterile 
pyrogen-free water, before use. 

The compounds may also be formulated in rectal 
35 compositions such as suppositories or retention enemas, e.g., 
containing conventional suppository bases such as cocoa 
butter or other glycerides. 
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In addition to the formulations described previously, 
the compounds may also be formulated as a depot preparation. 
Such long acting formulations may be administered by 
implantation (for example subcutaneous ly or intramuscularly) 
5 or by intramuscular injection. Thus, for example, the 
compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble 
10 salt. 

A pharmaceutical carrier for the hydrophobic compounds 
of the invention is a cosolvent system comprising benzyl 
alcohol, a nonpolar surfactant, a water-miscible organic 
polymer, and an ac[ueous phase. Naturally, the proportions of 
15 a co-solvent system may be varied considerably without 
destroying its solubility and toxicity characteristics. 
Furthermore, the identity of the co-solvent components may be 
varied. 

Alternatively, other delivery systems for hydrophobic 
20 phzurmaceutical compounds may be employed. Liposomes and 
emulsions are well known examples of delivery vehicles or 
carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsu If oxide also may be employed, although 
usually at the cost of greater toxicity. Additionally, the 
25 compounds may be delivered using a sustained-release system, 
such as semipermeable matrices of solid hydrophobic polymers 
containing the therapeutic agent. Various of sustained- 
release materials have been established and are well known by 
those skilled in the art. Sustained-release capsules may, 
30 depending on their chemical nature, release the compounds for 
a few weeks up to over 100 days. Depending on the chemical 
nature and the biological stability of the therapeutic 
reagent, additional strategies for protein stabilization may 
be employed. 

35 The pharmaceutical compositions also may comprise 

suitable solid or gel phase carriers or excipients. Examples 
of such carriers or excipients include but are not limited to 
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calcium carbonate, calcium pho&phat:e, various sugars, 
starches, cellulose derivatives, gelatin, and polymers such 

as polyethylene glycols. 

Many of the compounds of the invention may be provided 
5 as salts with pharmaceutical ly compatible counterions. 
Pharmaceutically compatible salts may be formed with many 
acids, including but not limited to hydrochloric, sulfuric, 
acetic, lactic, tartaric, malic, succinic, etc. Salts tend 
to be more soluble in aqueous or other protonic solvents that 
10 are the corresponding free base forms* 

5.9.3 ROUTES OF ADMINrB TOATIOK 

Suitable routes of administration may, for example, 
include oral, rectal, transmucosal, transdermal, or 
15 intestinal administration; parenteral delivery, including 
intramuscular, subcutaneous, intramedullary injections, as 
well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal, intranasal, or intraocular injections. 

Alternately, one may administer the compound in a local 
20 rather than systemic manner, for example, via injection of 
the compound directly into an affected area, often in a depot 
or sustained release formulation. 

Furthermore, one may administer the drug in a targeted 
drug delivery system, for example, in a liposome coated with 
25 an antibody specific for affected cells. The liposomes will 
be targeted to and taken up selectively by the cells. 

5.9.4 PACKAGING 

The compositions may, if desired, be presented in a pack 
30 or dispenser device which may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by 
instructions for administration. Compositions comprising a 
35 compound of the invention formulated in a compatible 

pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labelled for treatment of an 
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indicated condition. Suitable conditions indicated on the 
label may include treatment of a disease such as one 
characterized by insufficient, aberrant, or excessive neurite 
growth, differentiation, or survival. 

5 

6. EXMCPXiE: THE IMTERACTZON BETWEEN 

CONTACTXM AMD TEE CAE 
DOMAIK OF RPTPfl 



The subsections below describe the 
interaction between contactin and the CAH domain of RPTP^. 
The data demonstrate that ligands for RPTP/S are 
differentially expressed in. neuronal and glial cell lines. 
In addition, it is shown that a 140 kDa protein from these 
cell lines interacts with the CAH domain of RPTP/3, and that 
this 140 kDa protein is contactin. The data also demonstrate 
^5 that RPTP/S interacts with both membrane-bound and soluble 
contactin. 

6.1* MATERIALS AND METHODS 



6.I.I. CEIii, CUL TURE 

SF763T and SF767T htiman astrocytoma cell lines were 
grown in athymic nu/nu mice to create a tumor derived cell 
line. The parental lines (SF763 and SF767) were generously 
provided by Dr. Michael E. Bernes (The Barrow Neurological 
Institute, Phoenix, Arizona). All other cell line used were 
supplied by the American Type Culture Collection (Rockville, 
MD) . For culturing of rat sensory neuron, spinal sensory 
ganglia were dissected from newborn rat pups and dissociated 
by incubation with trypsin (0.05% for 10 minutes). The 
ganglia were washed several times in LIS + 10% fetal calf 
serum, and triturated with a pasteur pipette. The resulting 
single cell suspension was not subjected to preplating. The 
cells were plated at l5,OO0 cells per well in an eight -well 
chamber slide (Nunc) precoated with 10 mg/ml laminin in PES. 
The medium was LlS/COj with supplements as described (Hawrot 
and Patterson, 1979, Meth. Enzymol., 58:547-584), and nerve 
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growth factor was added at 50 ng/ml. The cells were cultured 
for two days prior to staining* 

6.1.2. GENERATION AND PRODUCTION 
^ OF FC^PUSIONS 

To construct the Fc-fusion molecule, different 
subdomains of RPTP/S extracellular region were amplified using 
pfu (Stratagene, La Jolla, CA) and cloned into a unique BamHI 
site upstream from the hinge region of himan IgGl-Fc. For 
the construction of iSC and fiCF fusions a DMA fragment was 
amplified from position -20, within the Bluescript sequence 
to position 939 and 1245 respectively (/3C-Fc aa 1-313, /5CF-Fc 
aa 1-415) (Levy et al,, 1993, J. Biol. Chem. , 268:10573- 
10581) . In frame fusion was made by creating a BamHI site in 
the 3« primer maintaining the original amino acids sequence 
in the fusion junction. These fragments were further cloned 
into Hindlll-BamHI linearized PC7I vector, a modified version 
of pIGl that contained a cDNA form instead of the genomic 
fragment of htuaan IgG (Simmons, 1993, in Cellular 
Interactions in Development. A Practical Approach, Hartley 

20 

(ed.), IRL Press). The same strategy was used to constiruct 
hiiman contactin-Fc (Hcon-Fc) fusion molecule. Briefly, total 
RNA was prepared from Y79 retinoblastoma cells and converted 
to single strand cDNA using Superscript II reveres 
transcriptase (Gibco-BRL) following the suppliers protocol. 
This cDNA was use as a template to clone human contactin by 
three overlapping PGR reactions into EcoRI-BamHI sites of 
PC7I vector. In order to use these sites, the EcoRI site at 
position 3173 (Reid et al., 1994, Brain Res. Mol. Brain Res., 
21:1*8) , was eliminated by changing a single base during the 

30 

PGR reaction. The final construct contained amino acids 1- 
1020 of human contactin fused to the IgG region. To 
construct jSF-Fc the region between nucleotides 901 to 1242 
was amplified with a set of primers that introduced SacII and 
BamHI sites in the ends of the fragment. This fragment was 

35 

cloned into pCN^l between the globulin gene and a sequence 
encoding a signal peptide derived from TGF/3 gene (Plowman et 
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al., 1992, J. Biol. Chem. , 267:13073-13078)- The integrity 
of all the above constructs was checked by complete 
nucleotide sequence determination or by restriction enzyme 
analysis. Fusion proteins were produced transiently in COS7 
5 cells or by cotransf ection with pN1012-Neo into 293 cells and 
selecting for individual G418 resistant clones as described 
(Peles et al.^ 1991, EMBO J., 10:2077-2086). Purification of 
fusion proteins was achieved by affinity chromatography on 
Protein-A Sepharose CL 43 (Pharmacia) . Bound proteins were 

10 eluted with 100 mM sodium citrate PH 2,5, IM MgCl^, followed 
by buffer exchange on a PD-10 desalting column (Pharmacia) . 
The proteins were analyzed by gel electrophoresis followed by 
silver staining (ICN, Costa Mesa, CA) . Concentration of the 
purified proteins was determined by bradford reagent (BioRad, 

15 Richmond, CA) , and by an ELXSA assay using peroxidase coupled 
antibody against human IgG (Pierce, Roxford, IL) . The same 
antibody was used to detect the fusion proteins by western 
blotting followed by chemiltiminescence reagent (ECL; 
Amersham) as described previously (Peles et al., 1992, Cell, 

20 69:205-216) • 

e.1.3. EXPRESBION CItONING IN CO S CELItS 

Total cellular RNA was prepared from GH3 cells using 
acid guanidinium thiocyanate extraction (Chomczynski and 

25 Sacchi, 1987, Anal. Biochem. , 162:156-159), and Poly (A) RNA 
was isolated by two passages over an oligo dT cellulose 
column (Pharmacia) . cDNA was synthesized using the 
Superscript kit (Gibco BRL, Bethesda, MD) by priming with a 
random primer that contained a Hindlll site. Following the 

30 addition of EcoRI adaptors the double-stranded cDNA was size 
selected on agarose gel. cDNAs larger then 2 kb were ligated 
into a EcoRl and Hindlll-digested pcMPl plasmid vector, a 
derivative of the pCMV-1 vector (Lammers et al. , 1993, J. 
Biol. Chem. , 168:24456-22462). E. coli DHIOB cells (GIBCO 

35 BRL) were transformed by electroporation REF. This procedure 
generated a cDNA library with 2 X 10^ independent clones. 
Pools of 3000 bacterial clones were grown for 24 hours and 
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scraped from plat:es using, LB containing 15% glycerol. 
Twenty percent of the cultures were saved as glycerol stocks 
at -70«C and plasmid DNA was prepared from the rest using the 
Wizard plasmid purification kit (Promega) . 
5 Plasmid DNA (10 fig) was transfected into COS7 cell grown 

on chamber slides (Nunc) with lipofectamin (GIBCO BRL) . 
After 72 hours cells were incubated for one hour with meditin 
containing 0.5 fig/ml /8CF-Fc. Unbound Fc-fusion proteins were 
removed by three washes with cold DMEM/F12 and the cells were 

XO fixed with 4% paraf ormaldehyd in PBS. Immunostaining was 
performed with ABC staining system (Vector Lab) , using 
biotinylated anti-human IgG antibodies (Fc specific; Jackson 
Labs, West Grove, PA) following by streptavidin alkaline 
phosphatase and NBT/BCIP as substrate according to the 

15 protocol provided by the manufacturer. One positive pool 

(#54) was subdivided and rescreened until a single clone (F8) 



DNA sequence determination was carried out using the 
dideoxy-chain termination method (Sanger et al., 1977, Proc 
20 Natl. Acad. Sci., USA 74:5463), with Sequenase 2.0 (United 
States Biochemical Corporation, Cleveland, OH) . Sequencing 
was performed on both strands by priming with synthetic 



25 6.1.4. COH8TRUCTIOK OP RPTP/J/BOP- 

pgCEPTOR CHIMERAS 

To generate a plasmid for the expression of 0CF/EK 
chimeras, a portion of the extracellular domain of RPTP<3 
containing the CAH and the FINIII domains 0CF. aa 1-418) was 
fused to the human EGF receptor at position 634, twelve amino 
acids after the transmembrane domain in its extracellular 
region. These fragments were amplified using pfu 
(Stratagene, La Jolla, CA) with a specific set of primers 
that introduce a BstBI site at the junction between the two 
genes. The resulting fragments were ligated into Bluescript 
(Stratagene, La Jolla, CA) . Proper fusion between the two 
molecules was verified by nucleotide sequence analysis. This 
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Chimeric gene was then subcloned into a NotI site in the 
reteroviral vector SRa-SL and viral stocks where prepared by 
cotransfecting COS-7 cells with this vector along with a 
helper virus plasmid (Muller et al., 1991, Mol. Cell, Biol., 
5 11:1785-1792). These viruses where used to infect NIH 3T3 
(clone 2.2), which lack endogenous EGF -receptor . Following 
infection, cells where selected in a medium containing Img/ml 
G418 (Gibco-BRL) and resistant colonies were individually 
grown and assayed for the expression of the chimeric receptor 
10 by Western blotting with antibodies against the carboxyl 
terminus of the EGF-R (Kris et al., 1985, Cell, 40:619-625) 
as described previously (Peles et al., 1992, Cell, 69:205- 
216) . 

15 6.1.5. BINDING OF FC-FUSION PROTEINS 

Confluent monolayer of cells were incubated for one hour 
with conditioned mediim containing 0.25-0 ,5 mg/ml Fc-fusion 
protein. The unbovmd proteins were removed by three washes 
with binding medium (0.1% BSA, 0.2% none fat dry milk in 

20 DMEM/F12) and the cells were further incubated with 1 ng/ml 

-Protein A (Amersham) , for 30 minutes at 4»C. Plates 
were washed three times with cold binding medium and cell 
bound radioactivity was determined as described previously 
(Peles et al., 1993, EHBO J., 12:961-971). Cellular staining 

25 using the Fc-fusion proteins was done using the procedure 
described above for expression cloning. 

6.1.6. CHEMICAI* CROSSLINKING EXPERIMENTS 

Cells were incubated for four hours with medium 
30 containing, the different Fc-fusion proteins. Following 

three washes with cold PBS/Ca (1 mM CaCls in PBS) , the cells 
were incubated for additional 30 minutes with PBS/Ca 
containing 1 mM DTSSP (3 , 3 • -Dithiobis[sulf osuccinimidyl — 
propionate]. Pierce, Rockford, XL). Free cross-linker was 
35 removed by additional PBS wash followed by quenching with 100 
mM glycine in TBS for 10 minutes at 4«C. Cell lysates were 
made in SBN lysis buffer (Peles et al., 1991, EMBO J., 
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10:2077-2086), and sepharose-protein A was added to the 
cleared lysates. Following two hours incubation at 4*C, the 
beads were washed three times with HNTG buffer (Peles et al., 
1991, EMBO J., 10:2077-2086), and the bound proteins were 
5 eluted by adding SDS PAGE sample buffer containing 5% /S- 
mercaptoethanol and further incubation for 10 minutes at 
95«C. 



6.1.7. 



SEOPENCIKG 



iO Cellular membranes were prepared from 5X10» GH3 cells by 

homogenization in hypotonic buffer that included 10 mM Hepes 
pH 7.5, 1 mM E6TA, 1 mM MgClj, 10 fig /ml aprotinin, 10 fig/Tal 
leupeptin and 2 mM PMSF. Nuclei and unbroken cells were 
removed by low speed centrifugation (lOOOg x 10 minutes at 
15 4»C), and the supernatant was then subjected to high speed 
centrifugation at 40000g (30 minutes at 4»C) . The membrane 
pellet was resuspended in SML solubilization buffer (2% 
Sodium monolaurate, 2 mM MgC12, 2 mM PMSF in PBS). After one 
hour incubation on ice the detergent-insoluble materials was 
20 removed by centrifugation, and the sample was diluted tenfold 
with PBS containing 2 mM MgCl,. This sample was loaded on a 
column of /8CF-FC bound to Sepharose Protein A (200 /*g /SCF- 
Fc/ml beads) at 4»C. The column was washed with SML buffer 
containing 0.15% detergent and the bound proteins were eluted 
25 by adding SDS sample buffer and heating to 95»C. Proteins 
were separated on 7.5% gel and electroblotted in CAPS buffer 
(100 mM CAPS, 10% MeOH) to ProBlott membrane (Applied 
Biosystems) . The membrane was stained with coomassie R-250 
and the 140 kDa band was excised and subjected to direct 
30 microsequencing analysis. Microsequencing was performed with 
an Applied Biosystems Model 494 sequencer, run using standard 
reagents and programs from the manufacturer. 

TO obtain internal peptide sequence the blotted band was 
moistened with neat acetonitrile, then reduced by the 
35 addition of 200 ul of 0.1 M Tris pH 8.5, 10 mM 

dithiothreitol, 10% acetonitrile. After incubation at 55 
for 30' the sample was cooled to room temperature and 20 ul 
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of 0.25M 4-vinylpyridine in acetonitrile added. After 30 
minutes at room temperature the blots were washed 5 times 
with 10% acetonitrile. Digestion was performed for 16 hours 
with 1 ug modified trypsin (Promega) in 50 ul of O.IM Tris pH 
5 8.0, 10% acetonitrile, 1% octylglucoside. Digestion was 
stopped by the addition of 2 ul of neat trif luoroacetic acid 
(TFA) . Peptides were separated on a 1 mm x 200 mm Reliasil 
C-18 reverse phase coliimn on a Michrom UMA HPLC run at 50 ul 
per minute. Solvents used were 0.1% TFA in water and 0.085% 
10 TFA in 95% acetonitrile/ 5% water. A linear gradient of 5 to 
65% B was run over 60 minutes. Absorbance was monitored at 
214 nm and pes^cs were collected manually into a 96 well 
polyethylene microtitre plate. Purified peptides were 
sequenced as described above. 

15 

6.I.8. TREATMENT WITH PI -PLC 

Cells grown to conf luency in 90 mm dishes were 
metabolically labeled with 100^ Ci/ml ["S] -methionine and 
cysteine mix (NEN, Boston, MA) for four hours at 37 ^C* 

20 Labeled cells were washed three times with MEM and incubated 
with 250 mU of phosphatidyl inositol specific phospholipase C 
(PI-PLC, Boehringer Mannheim or a kind gift from Dr. J. 
Salzer) for 50 minutes at 37**C. The supernatant was 
collected and cleared by centrifugation (lOOOg), membranes 

25 were prepared from the cells and further solubilized in SML 
buffer as described above. /SCF-Fc bound to Sepharose-protein 
A beads was added to the supernatant and the membrane 
fractions for one hour at <^C. The beads were washed twice 
with 0.15% soditim monolaurate in PBS and once in PBS before 

30 the addition of SDS sample buffer. The precipitated proteins 
were separated on 7.5% cell and subjected to autoradiography. 

For binding experiments, cell were treated with 
different amounts of PI-PLC (as indicated in the legend to 
the figures) in MEM containing 0.5% BSA for 30-60 minutes at 

35 37 ^C. Cells were briefly washed and binding of iSCF-Fc was 
performed as described above. 
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6.2.1. THE CXB. DOMAIM OP RPTP/3 MEDIATES 

AM IKTER ftCTTOM WTTH NEDRON8 

TO identify cellular ligands for RPTP/3, fusion proteins 
were constructed between different subdomains of RPTP/J and 
the Fc portion of human IgG. Three chimeric constructs were 
made, one containing both the carbonic anhydrase and the 
fibronectin domains (/8CF-Fc) and two others carrying each 
domain by itself (/SC-FC or /5F-FC) . Initially, 0CF-Fc was 
used to screen for a membrane bound ligand on the surface of 
different neuronal and glial cell lines. Several cell lines 
that bind this fusion protein were identified. These were 
the IMR-32 neuroblastoma cells, the two closely related 
neuroendocrine derived cell lines GH3 and GHl, and five 
different glioblastoma cell lines. 

The fact that these positive cell lines were derived 
from glial and neuronal origins raised the possibility that 
RPTP/S may interact with two different membrane-associated 
ligands. Alternatively, a single ligand may exist which is 
expressed by both neurons and glia cells. To explore these 
two possibilities it was examined whether a fusion protein 
that contained only the CAH domain of RPTP^ (3c-Fc) will 
retain the same cell specificity observed with /SCF-FC. It 
was reasoned that in a multidomain receptor like RPTP^, each 
domain might function as an independent unit in terms of its 
interaction with a specific ligand. Thus, the use of a 
single domain in binding experiments might allow the 
identification of a cell type specific ligand. As depicted 
in Fig.2A, this fusion protein, indeed, binds to the same 
neuronal and neuroendocrine cell lines. In contrast, none of 
the glioblastomas were positive, suggesting that there are at 
least two ligands for RPTP^ that are differentially expressed 
on neuronal or glial cells. This result also implied that 
the CAH domain mediates the interaction of RPTP^ with a 
specific ligand present in neurons but not in glia cells. 
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Accordingly, if the binding of iSC-FC to netironal ligand 
reflects the interactions occurring in vivo, one would expect 
to see similar binding specificity on cultures of primary 
neurons. The binding of the different fusion proteins to 
5 cultured dorsal root ganglion cells (DRG) , followed by 
detection of the bound proteins by ixnmunostaining, was 
analyzed. fiC-FC and /SCF-FC bound to GH3 cells, as well as to 
the primary neurons. A fusion protein containing the 
fibronectin domain alone ()8F-Fc) failed to bind to either GH3 

10 cells or DRG neurons. In other experiments, binding of /3F-FC 
to several glial cell lines was detected, but no binding of 
this domain to neuronal derived cell lines or neurons derived 
from rat DRGs and chick cortex was detected. In addition, it 
was examined whether the binding specificity observed with 

15 the CAH domain of RPTP/8 is unique to this receptor by 

comparing it with the related phosphatase RPTP7 (Barnea et 
al., 1993, Mol. Cell. Biol., 13:1497-1506). A fusion protein 
made with the CAH domain of this highly homologous family 
member did not bind to GH3 cells or to primary netirons. 

20 Altogether these results suggests that specific ligands 

for RPTP/S exist on the surface of cells from neuronal and 
glial origin. Different subdomains of the receptor mediate 
its interaction with those distinct ligands. The CAH 
mediates an interaction with neurons while the FNIII enables 

25 the interaction of RPTP/S with glia cells. In the work 
presented here, the identification and molecular 
characterization of the ligand for the CAH domain is 



30 6.2.2. COVALENT CROSSLIMKZHG SXPERZMENTS 

REVEAIi A 140 KDA PROTEIN THAT INTERACTS 
WITH THE CAH DOMA IN OF RPTPfl 

To characterize ligands for RPTP/3, a reversible cross- 
linker (DSSTP) was used, and proteins were sought that 
specifically bound to jSc-Fc. Two of the cell lines that 
bound /5C-Fc (IMR32 and GH3) , as well as COS7 cells as a 
control, were allowed to react with the fusion proteins 
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containing the FNIII or the CAH domains followed by cross- 
linking and precipitation of the complexes. As shown in Pig. 
3, a protein of about 140 kilodalton specifically reacted 
with fiC-Fc in the rat GH3 and hijunan lMR-32 cells. No 
5 reactivity was detected in control cells or in cells 
incubated with /JF-Fc. The cross-linker (DSSTP) used, 
undergoes cleavage in the reducing SDS PAGE conditions and, 
therefore, permits the identification of the true molecular 
weight of the putative ligand. This result suggested that 
10 the same ligand is expressed in the rat GH3 and the hximan 
IMR-32 lines. 

6.2.3. MOLECULAR CLONIHG OF A CAMDZOATE 
LZOAIID FOR RPTP/3 FROM RAT 
GR3 CELLS REVEALS THE RAT 
15 gOMOLOQU g OF COMTAGTIM 

An expression cloning strategy was employed in an effort 
to clone the gene that encodes the 140 kDa candidate ligand. 
we have employed. Plasmid pools made from a GH3-cDNA library 
were transfected into COS7 cells and the cells were screened 
20 for their ability to bind /SCF-Fc. Positive cells were 

detected by immunostaining with biotinylated anti-human IgG 
antibodies and sterptavidin alkaline phosphatase. One 
positive pool was identified that when transfected yielded 
several stained cells on the slide. This pool was subdivided 
25 and rescreened four times until a single clone (F8) was 
isolated. Transfection of COS7 cells with this plasmid 
resulted in positive staining of approximately 25%-5G% of the 
cells, a number that correlates well with the maximum 
transfection efficiency in our system. DNA sequence analyses 
30 of clone F8 showed that it contained a 3.9 kb insert and a 
single long open reading frame of 3063 nucleotides. The 
deduced 1021 amino acid sequence encoded by this clone has 
been presented elsewhere. Data bank search with this 
sequence showed that it shares 95% and 99% identity at the 
3S amino acid level with human and mouse contactin respectively 
(Berglund and Ranscht, 1994, Genomics, 21:571-582; Gennarini 
et al., 1989, J. cell. Biol., 109:755-788; Reid et al. , 1994, 
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Brain Res. Mol- Brain Res., 21:1-8). It was therefore 
concluded that the ligand for RPTP/S cloned from GH3 cells is 
the rat homologue of contactin. Structurally, this protein 
consists of six C2 type Ig domains, four fibronectin type III 
5 repeats and an hydrophobic region that mediates its 

attachment to the membrane by a GPI linkage (Gennarini et 
al., 1989, J. cell. Biol., 109:755-788; Reid et al., 1994, 
Brain Res. Mol. Brain Res., 21:1-8). Functionally, it is a 
neural cell adhesion molecule that has been suggested to play 
10 a morphogenic role during the development of the nervous 
system (Rathjen et al., 1987, J. Cell. Biol., 104:343-353; 
Walsh and Doherty, 1991, Cell. Biol. Int. Rep., 15:1151- 
1166) • 

In parallel to the expression cloning strategy, and as a 
15 complementary approach, a biochemical procedure was employed 
that utilized the CAH domain as an affinity reagent for 
protein purification. pl40 was purified from solubilized 
membranes prepared from GH3 cells on a column of iSCF-Fc. 
After resolving the eluted protein on SDS/PAGE, the 140 kDa 
20 species was subject directly to N-terminal sequencing, or was 
digested with trypsin. Two peptide secjuences obtained, one 
from the N-texnaiinus and the other from an internal peptide 
after tryptic digest. Both sequences matched the translated 
F8 sequence and confirmed that contactin is indeed a ligand 
25 for the CAH domain of RPTP/3. 

6.2.4. BINDING AKALYfllS OF RPTPB AND CQMTACTXN 

The binding specificity of different subdomains of 
RPTPfi towards contactin was examined. COS7 cells were 

30 transfected with rat contactin (clone F8) and analyzed for 
their ability to bind fusion proteins containing the CAH, 
FNIII or both domains. As expected, expression of contactin 
enabled the binding of the CAH domain of RPTP/9 to the cells. 
The FNIII domain alone did not bind to contactin expressing 

35 cells. In addition, similar results were obtained with a 
fusion protein that carries most of the extracellular region 
of the short form of RPTP/S (aa 1-644; data not shown). 
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The reciprocal interaction, namely, whether soluble 
contactin molecules are able to bind specifically to cells 
expressing RPTP|8, was explored next. In these experiments, 
COS7 cells were transfected with chimeric receptor constructs 
5 that consist of the entire extracellular region of the short 
form of RPTP/5 OCFS/EK) , the CAH domain plus the FNIII repeat 
(/SCF/EK), or the CAH domain alone (/SC/EK) fused to the 
transmembrane and intracellular domains of the EGF receptor. 
A chimeric receptor was used instead of the wild type 
10 phosphatase because the wild type phosphatase was not able to 
be expressed in heterologous cells. Human contactin-Fc 
fusion protein binds to cells transfected with these chimeric 
receptors but not to control cells. Taken together, these 
results demonstrate that expression of contactin is both 
15 necessary and sufficient for binding to the CAH domain RPTP/S. 

6.2.5. SOLUBLE COMTACTIH RELEASED PROM THE 

MEMBRANE BY PH08PH0LIPASB C TREATMENT 
TMTERACTg WITH RPTPg 

contactin belongs to a family of recognition molecules 
*® that TAG-1 and BIG-1, all of which are anchored to the plasma 
membrane via a glycosyl-phosphatidy linos itol (GPI) . 
Therefore, it was of interest to see how phospholipase C (Pl- 
PLC) treatment would effect the interaction between contactin 
and RPTP/3. When incubated with COS7 cells expressing 
contactin (clone F8) , PI-PLC completely abolished the binding 
of /SCF-Fc to the cells. Similar results were obtained also 

with GH3 cells. 

It has been demonstrated that members of this family and 
other GPI -linked proteins may exist either in a membrane 
bound or a secreted soluble form that is released from the 
cell surface (Furley et al., 1990, Cell, 61:157-170; 
Th6veniau et al., 1992, J. Cell. Biochem. , 48:61-72). Hence, 
it was examined whether the different forms of contactin, 
including those released after PI-PLC treatment, could 
interact with RPTP/3. To this aim, GPI-linked proteins were 
released from metabolically labeled GH3 cells with the enzyme 
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and purified contact:in by bioaff inity precipitation from 
membrane extracts of the cells or the cell supernatants . 
Without PI-PLC treatment, two proteins pl40 and pl90 from the 
membrane fraction could specifically associate with /8C-Fc. 
5 These proteins were not present in the supernatant and they 
were not detected with /JF-Fc. However, after PI-PLC 
treatment, pl40/contactin could be precipitated from the 
medium of the cells, indicating that the soluble form 
produced by phospholipase treatment interacts with RPTP/S. 

10 This result may suggest that, in addition to the interaction 
between the membrane bound forms of these proteins, soluble 
contactin could potentially interact in vivo with RPTP/5. /Sc- 
Fc could precipitate the 190 kilodaltons protein only from 
membrane fraction and not from the cell supernatant. PI-PLC 

15 treatment did not release this protein from the cells 

suggesting that it is either an integral membrane protein or 
a cytoskeletal protein associated with contactin complexes. 
Since contactin by itself is sufficient to mediate the 
interaction with RPTP^, the 190 kDa protein may be associated 

20 with contactin in the cells and coprecipitated with it during 
the bioaff inity procedure. One intriguing possibility is 
that pl90 is a signaling unit used by contactin on the 
surface of neurons (see below) . 

25 7. EXAMPI.E: THE CAH DOMAIN OF RPTP/3 

INDUCES CONTAC TIN MEDIATED 
NEURITS OUTGROWTH 

The subsections below describe the induction, by the CAH 
domain of RPTPjS, of contactin mediated neurite outgrowth. It 
is shown that the CAH domain of RPTP/5 is a permissive 
substrate for neuronal adhesion and neurite growth. In 
addition, it is also shown that the neurite growth, 
differentiation and survival induced by the carbonic 
anhydrase-like domain of RPTP0 is mediated by neuronal 
contactin . 
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The materials and methods for this example were the same 
set forth in the example described in section 6.1 
above, except as supplemented or amended below, 

5 

7.1.1. WgnnTTE OOTOROITPH ABBAY8 

Neurite outgrowth assays using IMR 32 cells were 
performed as described previously (Friedlander et al., 1994, 
J. Cell. Biol., 125:669-680) using 35 mm petri dishes coated 
16 with different proteins adsorbed on the sxibstrate. After 
blocking the dishes with 1% BSA/PBS, the blocking solution 
was replaced with 3 X 10* cells suspended in 140 nl of 
DMEM/F12/ITS. Following incubation for 3 hrs at 37 during 
which time most of the cells adhered to the dish, the medium 
15 was removed and replaced with DMEM/FI2/ITS medium containing 
antibodies (Ig fraction purified by ammonium sulfate 
precipitation and DE52 chromatography) . Dishes were 
incubated for 48 hrs and fixed with Hanks/0.3% sucrose 2.5% 
paraformaldehyde. For PI-PLC treatment, primary tectal 
20 neurons (5 X 10* cells/250 ml) were prepared from E9 chick 
embryos (Grumet et al., 1984, Proc. Natl. Acad. Sci. USA, 
81:267-271) and incubated with 0.25 ^1 of PIPLC (1.7 U/ml) in 
DMEM/F12/ITS+ at 37 "C for 30 min. The cell suspension was 
then incubated on dishes coated with different substrates 
25 without changing the medium. 

7.2. RESULTS 

7.2.1. NEURXTE OUTGROirTB INDUCED BY THE 

CAE DOMAIN OF KPTVfi IS MEDIATED 
3® «PmOOgH CQOTPACTIN 



Contact in has been shown to be involved in both positive 
and negative responses of neurons to various stimuli 
(BrUmmendorf and Rath j en, 1993, J. Neurochem. , 61:127-1219). 
When presented as a ligand to neurons, either as a membrane- 
bound or a soluble form, contact in induces axonal growth 
(BrUmmendorf et al., 1993, Neuron, 10:711-727; Clarke et al.. 
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1993, Eur. J- Cell- Biol., 61:108-115; Durbec et al., 1992, 
J. Cell. Biol., 117:877-887; Gennarini et al., 1989, J. Cell. 
Biol., 109:755-788). Its neural receptor has been identified 
as the recognition molecule Nr-CAM (Morales et al., 1993, 
5 Neuron 11:1113-1122). On the other hand, contactin itself is 
a receptor present on neurons and mediates their repulsion by 
the extracellular matrix protein janusin (Pesheva et al., 
1993, Neuron, 10:69-82). The results described in the 
example of Section 6.1 indicate that the CAH domain of 
10 can bind to contactin on cells. To analyze effects of this 
binding on neurons, chick tectal cells, known to express 
contactin, were plated on dishes previously coated with ^CF- 
Fc fusion protein or with Ng-CAM or laminin as controls. 
Cells attached and grow processes on both of these 
15 substrates. Treatment of the cells with PI-PLC prior to 
plating completely abolished cell attachment and neurite 
extension on RPTP/3. In contrast, PI-PLC did not have a 
dramatic effect on cells growing on Ng-CAM or laminin as 
substrate. Thus, it was concluded that the CAH domain of 
20 RPTP^J is a permissive substrate for neuronal adhesion and 
neurite growth. Moreover, the cell adhesion and axonal 
elongation induced by RPTPjS is mediated through a GPI- 
anchored receptor • 

Next it was investigated whether contactin could be the 
25 neuronal receptor for the CAH domain of RPTP/5. To this aim, 
a human neuroblastoma cell line IMR-32 was used that has the 
capacity to differentiate and to elaborate neurites in 
response to different stimuli (LUdecke and Unnsicker, 1990, 
Cancer, 65:2270-2278) • These cells have fibroblastic 
30 morphology when crown on petri dishes coated with 

fibronectin, but on laminin substrates they assume a neuronal 
phenotype and extend processes with growth cones. A similar 
morphologic differentiation was seen after plating the cells 
on the CAH domain of RPTP/S. In contrast, the CAH domain of 
35 RPTP7 had no effect on cell adhesion and differentiation. 
These results show that IMR-32 cells respond specifically to 
the carbonic anhydrase domain of RPTP/3. To determine whether 
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contactin could be acting as a receptor on the IMR-32 cells 
for RPTP/S, the effects of antibodies against contactin on the 
growth of cells on different substrates were tested. 
Antibodies against contactin inhibited the growth of 
5 processes on ^C-Fc and /3CF-Fc but not on laminin. In the 
presence of these antibodies, the lMR-32 cells also retracted 
their processes and many cells lifted off the dish yielding 
fewer cells after 2 days of incubation. No effect was 
observed with control antibodies. Thus, the neurite growth, 
10 differentiation and survival induced by the carbonic 
anhydrase-like domain of RPTP/? is mediated by contactin 
present in the neurons. 

8. example: the CX.ONIM6 OF pX90 AMD THE 

TMTERAGTIOM BETWBgW TT AMP COMTACTIM 



The subsections below describe the purification and 
sequencing of pl90 protein and the subsequent cloning of rat 
and human pl90 cDNA. The interaction between pl90 and 



20 



ifMP METHODS 



gamianeina 



Solubilized membrane lysate was prepared from 3X10' GH3 
cells and loaded on a colximn of /SCF-Fc bound to Sepharose 



protein A (Pharmacia) as described previously (Peles et al., 
1995, Cell, 82:251-260). Bound proteins were separated on 
6.5% SOS gel, blotted to ProBlot membrane (Applied Biosystem, 
inc.) and stained with Coomassie R-250. To obtain internal 
peptide sequence, the blotted 190 JcDa band was moistened with 
neat acetonitrile and then reduced by the addition of 200 ul 
of O.IM Tris pH 8.5, 10 mM dithiothreitol, 10% acetonitrile. 
Digestion was performed for 16 hours with 1 ng modified 
trypsin (Promega) in 50 ul of O.IM Tris pH 8.0, 10% 
acetonitrile, 1% octylglucoside. Digestion was stopped by 
the addition of 2 ul of neat trif luoroacetic acid (TFA) . 
peptides were separated on a l mm x 200 mm Reliasil C-18 
reverse phase column on a Michrom tJMA HPLC run at 50 ul per 
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minute • 

in 95% acetonit:rile/5% 
B was run over 60 minuses, 
nm and peaks were collect:ed 
5 polyethylene microtiter 

( 



0.1% TFA in water and 0.085% TFA 

A linear gradient of 5 to 65 % 
Absorbance was monitored at 214 
manually into a 96 well 

Purified peptides were 

et al., 1995, Cell, 82:251- 



260) 



Cloning of Rat and Huma n CASPR/P190 cDNA 

10 The sequence of one tryptic peptide obtained from the 

purified protein ( QNLPQILEE8 ) was found in a 900 bp EST 
fragment B102/LF98 from the BRCAl region on chromosome 17q21 
(Friedman et al., 1994, Cancer Res., 54:6374-6382). Primers 
corresponding to this region (5' primer: TCXS CAG GCT ATG AGC 

15 CTG GCT ACA TCC; 3« primer: GTG 6GT AGG GGA GGT TTG CTG CCA 
GG) were use for RT-PCR to clone this DNA fragment from rat 
GH3 cells. A 600 bp DNA fragment derived from this region 
was further used as a probe to screen a ZAPEX-GH3 cDNA 
library* This cDNA library was constructed in ZAP-Express 

20 phage (Stratagene^ San Diego, CA) , using oligo dT priming. 
Plate hybridization and other cloning technicjues were 
performed according to standard procedures (Sambrook et al-, 
1989, Molecular cloning: A Laboratory Manual, 2nd Edition 
(Cold Spring Harbor, NY: Cold Spring Harbor Laboratory)). 

25 Clone ZX5 had a 2.5 kb insert that contained in addition to 
the B102 fragment, a sequence downstream that matched 
additional peptide secjuence. A second cDNA library was made 
from GH3 mRMA by priming with a specific oligonucleotide GGA 
GGT CTC CTT TAG according to the sequence that was found in 

30 the 5* end of clone ZX5. This cDNA was cloned into ZAP- 
Express (Stratagene, San Diego, CA) to generate ZB-GH3 
library. This library was use to isolate multiple clones 
that overlapped with ZX5 and contained the 5* end of the gene. 
To clone the human gene a cDNA library was made from IMR32 

35 neuroblastoma cells in ZAP-Express (ZX-IMR) . Probes were 

generated by PCR from the 5» ends of rat clone ZB181 and from 
XHR32 cDKA according to the B102 sequence as described above 
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for the rat gene. Several clones had a 5 kb insert that 
contained the full length gene. DNA sequence determination 
was carried out using the dideoxy-chain termination method 
with Sequenase 2.0 (United States Biochemical Corporation, 
5 Cleveland, OH) . Sequencing was performed on both strands by 
priming with synthetic oligonucleotides. 

8.1.3. P?TrffPP^-**° Construots 

An EcoRI-XhoI fragment containing the 5 • end of rat 
10 CASPR/P190 (from clone ZB161) was ligated with an XhoI-EcoRI 
fragment containing the 3' end of the gene (from clone ZB181) 
and cloned into pCMPl (Peles et al., 1995, Cell, 82:251-260) 
to generate pCM190R. An HA-tagged version of the gene was 
constructed by replacing an EcoRI-AccI fragment with a PCR- 
15 generated fragment containing the HA-tag secpience. This 

resulted in the addition of the HA sequence to the 3« end of 
the coding region of rat CASPR/pl90 to generate pCM190HA. 
Construction of contactin expression vectors was previously 
described (Peles et al. , 1995, Cell, 82:251-260). The 
20 plasmids pSGT-cSRC and pSGT-fyn, containing human src and fyn 
genes and the plasmids used for generation of the 6ST-SH3S 
fusions were described previously (Erplel et al. , 1995, EMBO 
J., 14:963-975). To generate a GST-fusion protein containing 
the cytoplasmic tail of rat CASPR/pl90, the corresponding 
25 region (aa 1308-1380) was amplified by PCR and cloned into 
PGEX-4T (Pharmacia) . The sequence of the final construct was 
verified by DNA sequencing. 

8.1.4. Morthem Bj^f^f^ Analysis 



30 Multiple tissue northern blots (HTN Blots, Clontech) 

were Used. A DNA fragment (position 3600-4232 of human 
CASPR/P190) was generated by RT-PCR from IMR32 mRNA. This 
fragment was labeled by random priming ("prime it"; 
Stratagene, San Diego, CA) , purified using PCR-clean column 

35 (Qiagen) and used as a probe. Hybridization was carried out 
for 16 hours in a buffer containing 5X SSC, 5X Denhart's 
solution, 50% formamide, 0.2% SDS and 100 ug/ml denatured 
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salmon sperm DNA at 42oC. The blots were washed at 60oC 
twice in a buffer containing 0.5X SSC, 0.1% SDS and once with 
O.IX SSC, 0.2% SDS. Signals were detected by 
autoradiography. The same membranes were reprobed with a 2 
5 kb human /3-actin cDNA as a control probe (Clontech, Palo 
Alto, CA) . 

ffHiinii^ft^ istoohemlgtrv and Ia«>Fusion Binding 
Production of different Ig-fusion chimeric proteins and 
10 cell binding experiments were done exactly as described 

previously (Peles et al. , 1995, Cell, 82:251-260). Staining 
of tissue sections with antibodies was done essentially as 
described (Milev et al., 1994, J. Cell Biol., 127:1703-1715). 



X5 generati on cf Antibodies 

Polyclonal antibodies against CASPR/pl90 were generated 
according to standard procedures (Harlow, 1990, Antibodies: A 
laboratory Manual) . Ab60 was obtained by immunizing rabbits 
with a GST-fusion protein containing all the cytoplasmic 

20 domain of rat CASPR/pl90 (GST-190CT) . Affinity purification 
was achieved first by passing the serum on a column of 
Sepharose-GST . Then, the unbound material was loaded on a 
column of GST-190CT Sepharose. Bound antibodies were eluted 
with 100 mM sodixun citrate pH 2.8 and 1.5 M MgCl,. Eluted 

25 material was precipitated with ammonium sulfate, resuspended 
in DDW and extensively dialysis against PBS. Antibody 87AP 
was generated against an eight aa long peptide corresponding 
to the C-terminal sequence of rat CASPR/pl90. Affinity 
purification on a Sepharose-peptide coltimn was done 

3 0 essentially as described above for Ab60. Antibodies against 
F3 were previously described (Faivre-Sarrailh et al., 1992, 
J. Neurosci., 12:257-267). Antibody CSTl that recognize Src, 
Fyn and Yes was previously described (Erplel et al., 1995, 
EHBO J., 14:963-975). Abl8 against Src and Abl6 against Fyn 

35 were purchased from Santa Cruz Antibodies (Santa Cruz, CA) . 
Monoclonal antibody against HA-tag was purchase from 
Boehringer. House polyclonal antibody against contactin was 
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generated by immunization of mice with purified human 
contactin-lg fusion protein according to Yoshihara et al, 
1994, Neuron, 13:415-426. 

5 8.1.7. aanarati on of Antl HCon-Ia Sera 

Immunoprecipitation and Western blot analysis: COS 
transfection protocol using Lipofectamine (Gibco-BRL) was 
previously described (Peles et al., 1995, Cell, 82:251-260). 
To detect the association between Contactin and CASPR/pi90 
10 the cells were grown to subconf luency and were metabolically 
labeled with lOOmCi/ml [''S) -methionine and cysteine mix (NEN, 
Boston, HA) for four hours at 37»C. Membranes were prepared 
from the cells and further solubilized in SML buffer (2% 
sodium monolaurate, 2 mM MgCl', 2 mM PMSF in PBS) . /Sc-Fc 
15 bound to Sepharose-protein A beads was added to a tenfold 
diluted supernatant and incubated for two hours at 4«C. The 
beads were washed twice with 0.15% sodium monolaurate in PBS 
and once in PBS before the addition of SDS sample buffer. 
The precipitated proteins were separated on 7.5% gel and 
20 subjected to autoradiography. 

Preparation of rat brain membranes: five P7 rat brains 
were pooled and homogenized in a glass homogenizer in a 
buffer containing 20 mM Hepes pH 7.4, 0.32 M sucrose, 1 mM 
EGTA, 1.5 mM HgS04, 10 /ig/ml Aprotinin and Leupeptin and 1 mM 
25 PMSF. Nuclei and heavy cell debris were removed by low speed 
centrifugation (3000g x lO minutes at 4''C) , and the 
supernatant was then subjected to high speed centrifugation 
at 40,000g for 60 minutes. The membrane pellet was 
resuspended in SML solubilization buffer. After one hour 
30 incubation on ice the detergent-insoluble materials was 
removed by centrifugation. The sample was diluted four to 
tenfold with PBS containing 2 mM MgCl, and subjected to 
precipitation with antibodies or Ig-fusions. 

Biotinylation of cell surface molecules was carried out 
35 for 20 minutes at 23«c using 50 fig/ml Biotin-I^-NHS (Pierce) . 
The reaction was stopped by adding NH4CI to final 
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concentration of 10 xnM followed by two washes with TBS- 
glycine buffer (50 mM Tris pH 7.4, 150 mM NaCl and 50 mM 
glycine) on ice prior to solubilization. 

Immunoprecipitation and western blotting was performed 
5 as described previously (Peles et al, 1992, Cell, 69:205- 
216) . Blots were reacted with streptavidin- linked peroxidase 
(Amersham) and detected using chemi luminescence reagent 
(Pierce) . 

10 8.2. RESUIiTB 

CASPR/Pl^O Gene and Qene products 

The 190 kD protein which associates with the CAH- 
contactin complex was pxirified using affinity chromatography 
with PC-Pc, utilizing the techniques described, above, in 
15 section 8.1. Briefly, membrane lysates from GH3 cells were 
applied to a /Jc-Fc column and bound proteins were separated 
by SDS-PAGE. The protein believed to correspond to pl90 was 
excised and subjected to trypsin digestion. The amino acid 
sequences of two tryptic peptides were detexnnined using a 
20 gas-phase microsequencer • The amino acid sequences obtained 
were then utilized to identify corresponding DNA fragments 
encoding such sequences, as described, above, in Section 8.1. 
The DNA fragments thus obtained were in turn used to isolate 
cDNA molecules encoding the full length pl90 gene products of 
25 both human and rat. 

The human CASPR/pl90 nucleic acid sequence is depicted 
in SEQ ID NO:l, and the human CASPR/pl90 amino acid sequence 
is depicted in SEQ ID NO: 2. The rat CASPR/pl90 nucleic acid 
sequence is depicted in SEQ ID NO: 3, and the rat CASPR/pl90 
30 amino acid sequence is depicted in SEQ ID NO: 4. 

The htunan and rat CASPR/pl90 transcripts have open 
reading frames that encode for 1384 and 1381 amino acids, 
respectively, and share 93% identity at the amino acid level. 
CASPR/pl90 is a putative type I transmembrane protein with a 
35 short proline-rich. cytoplasmic domain. (The transmembrane 
domain is marked as TMD in Figure 1) . 
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The first pl90 methionine is followed by a stretch of 
19-20 amino acid residues rich in hydrophobic residues, which 
probably acts as a signal secjuence. The extracellular 
domains of rat and human CASPR/pl90 contain 1281 and 1282 
5 amino acid residues, respectively. The extracellular region 
of CASPR/P190 contains 16 potential N-linked glycosylation 
sites followed by a second hydrophobic stretch that is a 
typical transmembrane domain. 

The CASPR/pl90 extracellular domain is a mosaic of 
10 several motifs known to mediate protein-protein interactions. 
Near the N-terminus of mature CASPR/pl90 (109 amino acid 
residues) is a domain with 31-33% amino acid identity to the 
CI and C2 terminal domains of coagulation factors V and VIII 
(Jenny et al., 1987, Proc. Natl. Acad. Sci. U.S.A., 84:4846- 
15 50; Wood et al., 1984, Nature, 312:330-37) and 26% identity 
with the neuronal adhesion molecule neurophilin (previously 
known as the neuronal A5 antigen) and 20% identity to a 
region of discoidin I, a lectin from the slime mold 
Dictyostelium discoideum (Takagi et al., 1991, Neuron, 7:295- 
20 307) . The domain is marked as DISC in Figure 1. The 
extracellular domain of CASPR/pl90 also contains foxir 
repeats, of approximately 140 amino acid residues each, with 
homology to neurexins, a family of polymorphic neuronal cell 
surface proteins. These domains are marked as NX1-NX4 in 
25 Figure 1. There are 6 copies of the motif in the o- 

neurexins, one in the /S-neurexins , and one to five in the C- 
terminal portions of laminin A, agrin, slit, and perlecan 
(Ushkaryov et al., 1992, Science, 257:50-56). Together, the 
five motifs in the basement membrane protein laminin A are 
30 referred to as the G domain, a region suggested to mediate 
cell adhesion. The first three neurexin motifs of CASPR/pl90 
share 29-32% amino acid identity to regions of rat 
neurexinlll-a and neurexinll-a, whereas the fourth motif is 
most similar to agrin (34% identity) . CASPR/pl90 also 
35 contains two epidermal growth factor (EGF)-like modules 
(marked as EGF1-EGF2 in Figure 1) ; both of which are most 
related to repeats within the drosophila neurogenic proteins 
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Notch and slit (39-46% identity) (Rothberg et al. , 1988, 
Cell, 55:1047-59; Wharton et al., 1985, Cell, 43:567-81). A 
single domain related to the C-tenninal region of fibrinogen 
beta/gamma (marlced as FIB in Figure 1) is flanked by an EGF 
5 and neurexin motif. Finally, there is a stretch of 47 amino 
acids, that is identical between human and rat CASPR/pl90, 
and contains seven copies of Pro-Gly-Tyr-X,.2 and three 
additional imperfect repeats of this sequence (marked as PGY 
in Figure 1) . The Pro-Gly-Tyr repeat is found in a molluscan 

10 adhesive protein (SW:A61077, and a putative chicken prior 
protein (SW:A46280) , whereas the Pro-X-Tyr repeat is present 
in multiple copies in a soybean cell wall protein (SW:A29324) 
and the X-Gly-Tyr repeat in heterogeneous nuclear RNP 
proteins (SW:B41732) . The cytoplasmic domain of human and 

15 rat CASPR/pl90 contain 78 and 74 amino acids, respectively. 
These include a 38-42 amino acid proline-rich motif (38% 
proline) , the majority of which consists of proline residues 
alternating with alanine, glycine, or threonine residues 
(marked as PRO in Figure 1) . Alignment of this region with 

20 the non-redundant protein database revealed several proteins 
containing such "PAPA« motifs. Proline-rich domains can 
serve as binding sites for SH3 -containing protein, yet none 
of the proteins that align with this domain of CASPR/pl90 are 
known to interact with an SH3 motif. 

25 

8.2.2. CAflPR/pl oo Bgpression 
Norl:hern blot analysis of mRNA isolated from human 
tissues reveals that CASPR/pl90 was expressed predominantly 
in the brain as a 6.2 kb transcript. Weak expression of 
30 CASPR/plSO was detected in ovary, as well as in the pancreas, 
colon, lung, heart, intestine and testis. Similar results 
wee obtained for rat tissue hybridized with a rat CASPR/pl90 
probe • 

A high level of CASPR/pl90 was detected in different 
35 regions of the adult human nervous system, including high 
expression in the cortex, cerebellum and in the thalamus, 
while weaker expression is detected in the spinal cord and in 
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the corpus callosuia* 
CASPR/pl90 gene was e 



analyses demonstrated that the 
predominantly in the central 



Polyclonal rabbit antibodies raised against a GST fusion 
5 protein containing the CASPR/pl90 cytoplasmic domain were 
raised and used to stain permeabilized human IMR-32 
neuroblastoma and rat GH3 neuroendocrine cell lines found to 
express CASPR/pl90. These studies revealed recognition of a 
190 kD protein. 

10 similar results were obtained staining COS7 cell lysates 

that had been transf ected with an expression vector directing 
the synthesis of CASPR/pl90. No CASPR/pl90 was detected in 
mock-transf ected or untransf ected cells. 

immunohistochemistry studies were then performed which 

15 demonstrated that CASPR/pl90 and contactin localized in the 
rat retina. Specific caiSPR/pl90 staining was seen in the 
ganglion cell fiber layer and in the inner plexiform layers. 
Similar staining was observed for contactin, with the highest 
expression in the nerve fiber layer containing the axons that 

20 project from the ganglion cells into the optic nerve. Thus, 
CASPR/P190 and contactin colocalize on neurons in fiber-rich 
areas of the retina. Further, increased CASPR/pl90 staining 
was detected in membrane preparations from rat brains from 
E18 to post-natal day eight, a period of extensive axonal 
25 outgrowth and synaptogenesis. A similar temporal expression 
pattern was detected in this tissue in the same period 
(Gennarini et al. , 1989, J. Cell Biol. 109:755-788). 



30 




The interaction between contactin, RPTP^ and CASPR/pl90 
was then investigated using soluble and membrane-associated 
variants of these proteins. Specifically, the possibility 
that the interaction between contactin and CASPR/pl90 
requires that both proteins be present on the same cell (cis 
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To examine this possibility, COS7 cells were transfected 
with expression vectors that expressed either CASPR/pl90 
alone or together with contactin. Lysates of transfected 
cells were subject to precipitation analysis with the CAH 
5 domain of RPTP/S (/Jc-Pc) . The CAH domain of RPTP^ only 

precipitated CASPR/pl90 from cells co-expressing contactin. 
Thus, it appears that the CAH domain of RPTPjS can form a 
ternary complex with contactin and CASPR/pl90 proteins • 
Similar results were obtained using an expression vector 
10 expressing tagged CASPR/pl90. 

Moreover, soluble contactin molecules did not associate 
with CASPR/P190 when RPTP/S and CASPR/pl90 were co-expressed 
in the saone cells. 

On the basis of these experiments, it appears that the 
15 CAH domain of RPTP<5 does not bind directly to CASPR/pl90 and 
that contactin and CASPR/pl90 are complexed by means of 
lateral interactions (cis) in the membrane, thus explaining 
the reason why the protein is referred to as CASPR (i*e> , 
Contactin-associated protein) . 

20 

8.2.4. complex Formation Batvaa n GASPR/PX90 and Contaptiii 

The role of RPTP/S in formation of the CASPR/pl90- 
contactin complex was next examined. IMR-32 cell lysates 
were subjected to immunoprecipitation with CASPR/pl90 

25 antibodies followed by immunoblotting with contactin 

antibodies. These experiments demonstrated that contactin 
and CASPR/pl90 were const itutively associated on the surface 
of the IMR-32 cells. In this cell line, it appeared that 
virtually all CASPR/pl90 molecules were associated with 

30 contactin. 

The existence of an in vivo contact in-CASPR/pl90 complex 
was also demonstrated using rat brain tissue. Lysates of P7 
rat brain membranes were subjected to precipitation with ^C- 
Fc followed by immunoblotting with antibodies specific to 
35 either contactin of CASPR/pl90. 

Taken together, these data demonstrate that contactin 
and CASPR/P190 are constitutive ly complexed in neuronal cell 
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lines and 
two proteins 



and that complex 
does not require RPTP^ 



8.2.5. mtoraotion Between CA6PR/pl90 and SH3 Domains of 

flianalina Molecules . . . 

Experiments described herein demonstrated that the 

CASPR/pl90 cytoplasmic domain can serve as a binding site for 

SH3 domains of signalling molecules which will transmit the 

signal initiated by RPTP/S binding to the eontactin-CASPR/pl90 



10 



20 



30 



Specifically, four of seven GST-SH3 domains of 
signalling molecules were able to bind selectively to the 
CASPR/plSO protein, including the SH3 domains of Src, Fyn, 
p85 and PLCy. Association was not detected with Csk, Grb2 or 
Gap SH3 domains. CASPR/pl90 did not bind to a mutant Src SH3 
domain in which a conserved Trp at position 118 was replaced 

with an Ala residue. 

Next it was determined that c-Src could associate with 
CASPR/P190 fusion proteins in transiently infected OOS7 
cells. Specifically, lysates of transfected cells were 
subjected to immunoprecipitation with antibodies against c- 
Src, followed by immunoblotting with anti-fusion antibodies. 

Further, the association between endogenous c-Src and 
CASPR/P190 in IMR-32 or GH3 cells was investigated using a 
similar immunoprecipitation/ immunoblotting strategy with Src 
and CASPR/P190 antibodies. Results of such experiments 
detected no association between c-Src and CASPR/pl90. 

These experiments raise the possibility that the 
cytoplasmic domain of CASPR/pl90 can serve as a target for 
SH3 domains of signalling molecules. 
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(A) LENGTHS 5294 base pain 
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(xl) SEQUENCE DESCRIPTIONS SBQ ID NOsls 
CAAGAGCGGA GGACCAGGAA CCAGAGA6AG AGAGAGAGAA AA6AGAGAGG AGAGACAGA6 60 

20 

CGCTTGGGGG CGAAAGGAGA GAGGGAGG6A AGGGTGGGTA AGGAGGAGAG AGCGGTCT6C 120 
TGCAAACCCC AGGAGGAGAG CTTGGAGCCC AAGCCAGAAC TOGAGCCCTA GCCGGAGCOG 180 



ATG ATG CAT CTC CGG CTC 235 
25 Met Met Hla Leu Arg Leu 

1 5 

TTC TOC ATC CTG CTC GCC GOG GTC TCA GOA GCC GAG GGC TGG OGC TAC 283 

Phe Cya He Leu Leu Ala Ala Val Ser Gly Ala Glu Gly Trp Gly Tyr 

10 15 20 

30 

TAC GGC TGC GAC GAG GAG CTG GTG GOT CCC CTG TAT GCA CGC TCC CTG 331 

Tyr Gly Cye Asp Glu Glu Leu Val Gly Pro Leu Tyr Ala Arg Ser Leu 

30 35 



GGC GCC TCC TCC TAC TAC AGT CTC CTT ACT GCG CCG CGA TTC GCC AGG 379 
35 Gly Ala Ser Ser Tyr Tyr Ser Leu Leu Thr Ala Pro Arg Phe Ala Arg 

40 45 50 
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CTG CAC CCC ATA ACC GGO TGO TCA CCA CGG ATT OGG GAT COG AAT CCC 427 

Z«eu Hie Gly lie Sor Gly Trp Ser Pro Arg He Gly Asp Pro Asn Pro 
55 60 65 70 

TGG CTG CA6 ATA GAG TTA ATG AAG AAG CAC CGG ATC CGG GGC GTG 6CC 475 
5 Trp Leu Gin He Asp Leu Met Lys Lys Hie Arg He Arg Ala Val Ala 

75 80 85 

ACA CAG GGC TCC TTT AAT TCT TGG GAC TGG GTG ACA CGT TAG ATO CTA 523 

Thr Gin Gly Ser Phe Aen Ser Trp Aep Trp Val Thr Arg Tyr Met Leu 

90 95 100 

10 

CTG TAG GGC GAG CGA GTG GAC AGC TGG ACA CGG TTG TAG CAG CGA GGG 571 
Leu Tyr Gly Aep Arg Val Aep Ser Trp Thr Pro Phe Tyr Gin Arg Oly 
105 110 115 

CAG AAG TGG ACC TTG TTT GGT AAG GTG AAG CAG TOO GOG GTG CTG CCC 619 
15 Hie Aen Ser Thr Phe Phe Gly Aen Val Aen Clu Ser Ala Val Val Arg 
120 125 130 

GAT CAG CTG CAC TTG CAC TTG ACT GOG CCC TAG ATG GGC ATC CTG CGG 667 

Hie Aep Leu Hie Phe Hie Phe Thr Ala Arg Tyr He Arg He Val Pro 
135 140 145 150 

20 

CTG CCC TGG AAG CCA CGG CGG AAG ATG CCC CTG AGC CTG CCC CTC TAT 715 
Leu Ala Trp Aen Pro Arg Gly Lye He Gly Leu Arg I*eu Gly Leu Tyr 

155 160 



GGG TCC CCA TAG AAC CCC CAG ATA CTC TAT TTG GAG GGC CAC CAT CCC 763 
25 Gly eye Pro Tyr Lye Ala Aep He Leu Tyr Phe Aep Gly Aep Aep Ala 

170 175 180 

ATG TCC TAG CGG TTG COG OGA CCC CTC ACC OCA ACC CTG TCC CAG CTG 811 
He Ser Tyr Arg Phe Pro Arg Gly Val Ser Arg Ser Leu Trp Aep Val 
185 190 195 

30 

TTG CCC TTG AGC TTG AAG ACC GAC GAG AAG GAC GGT CTT CTG CTC GAG 859 

Phe Ala Phe Ser Phe Lye Thr Clu Glu Lye Aep Gly Leu Leu I#eu Hie 
200 205 210 

CCC CAG GGC CCC CAC CGG GAG TAG GTG ACC CTC GAG CTG CAC GGG CCA 907 

35 Ala Clu Gly Ala Gin Gly Aep Tyr Val Thr I*eu Glu Leu Clu Gly Ala 
215 220 225 230 
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cue CTG CTG CT6 CAC ATG AOC 
HiB I»eu Leu Lieu Hie Met Ser 

235 



GGC AGO AGC 
Gly Ser Ser 
240 



ATC CAG CCA AGA 
He Gin Pro Arg 
245 



CCA OCT CAC ACC ACC GTG AGC GCA GGC OGA GTC CTC AAT GAC CAG CAC 
5 Pro Gly Hie Thr Thr Val Ser Ala Gly Gly Val Leu Aen Aep Gin Hie 

250 255 260 



1003 



CAC TAT CTC OGG GTG GAC C6A TTT GGC OGC CAT OTA AAT TTC ACC 

HiB Tyr Val Arg Val Aep Arg Phe Gly Arg Aep Val Aen Phe Thr 

270 275 



10 



GAC GGC TAT GTG CAG OGC TTT ATT CTC AAT GGA GAC TTC GAG AGG 

Aep Gly Tyr Val Gin Arg Phe He Leu Aen Gly Aep Phe Glu Arg 
280 285 290 



1051 



1099 



CTG AXC CTG GAC ACT GAG ATG TTC ATC GGA CGT CTG GTG GGC GCC GOG 
X5 Leu Aen Leu Aep Thr Glu Meti Phe Xle Gly Gly Leu Val Gly Ala Ala 
295 300 305 310 



1147 



COG AAG AAC CTG GCC TAT OGG CAT AAC TTC OGC GGC TOO ATA GAA AAC 

Arg Lye Aen Leu Ala Tyr Arg Hie Aen Phe Arg Gly Cye He Glu Aen 

320 



20 



GTA ATC TTC AAC OGC GTC AAC ATC CCA GAC CTG GCC GTG OGG OGC CAT 

Val He Phe Aen Arg Val Aen He Ala Aep Leu Ala Val Arg Arg Hie 

330 335 340 



TCC OGG ATC ACC TTC GAG GGT AAG CTG GCT TTT CGT TGC CTG GAC CCG 
25 ser Arg He Thr Phe Glu Gly Lye Val Ala Phe Arg Cye Leu Aep 

350 355 



1195 



1243 



1291 



30 



GTA CCG CAC 
Val Pro Hie 
360 

CCC GGT TTC 

Pro Gly Phe 

375 



ATC AAC TTC GGA OGC OCT CAC AAC TTC GTT CAA GTG 

He Aen Phe Gly Gly Pro Hie Aen Phe Val Gin Val 

370 



1339 



OGC OCT GGC OGC CTG OCA GTC TCA TTT OGC TTC OGC 

Arg Arg Gly Arg Leu Ala Val Ser Phe Arg Phe Arg 
380 385 390 



ACC TOG GAC CTC ACC OGG CTT CTC CTT TTC TCC CGT CTG GOG GAC GOG 
35 Thr Trp Aep Leu Thr Gly I*eu Leu Leu Phe Ser Arg Leu Gly Aep Gly 

400 405 



1387 



1435 
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CTG GGC CAC GTO GAG 
I^u Gly Hi.8 Val Glu 

410 



AOG CTC AGC 6AA GGG CAG GTC AAC QTG 
Thr Leu Ser Glu Gly Gin Val Aan Val 
415 420 



1483 



ATC GOG CAG AGC GGC 

5 He Ala Gin Ser Gly 

425 



AAG AAG CTT CAG TTC GCT OCT GGG TAC 
LyB Lye Leu Gin Phe Ala Ala Gly Tyr 
430 



1531 



10 



CTG AAT GAC GGC TTT TGG CAC GAG GTG AAT TTT GTG GCA CAG OAA AAC 

Leu Aen Asp Gly Phe Trp Hie Glu Val Aan Phe Val Ala Gin Glu Asn 
440 445 450 

CAT GCA GTT ATC AGC ATT GAT GAT GTG GAA GGG OCA OAG GTC AOG OTC 
Hie Ala Val lie Ser lie Asp Asp Val Glu Gly Ala Glu Val Arg Val 

460 465 470 



1579 



1627 



TCA TAC CCG TTG CTG ATC CGG ACA GGG ACC TCA TAT TTC TTT GGG GOT 
15 ser Tyr Pro Leu Leu He Arg Thr Gly Thr Ser Tyr Phe Phe Gly Gly 

475 480 485 



1675 



20 



TOG GAC TGC CAC TCC AAC CAG ACG GCA 

Trp Asp Cye His Ser Asn Gin Thr Ala 
495 500 

CTC AAG GTG GAT GOT CAA CTG GTC AAC 
Leu Lys Val Asp Gly Gin Leu Val Asn 
510 



TGT CCC AAG CCA GCC AGT 
Cye Pro Lys Pro Ala Ser 

490 

TTC CAT GGC TGC ATG GAG 
Phe His Gly Cys Met: Glu 
S05 



CTG ACT CTG GTG OAG GGC CGG CGG CTT OGA TTC TAT GCT GAG GTC CTC 
25 Leu Thr Leu Val Glu Gly Arg Arg Leu Gly Phe Tyr Ala Glu Val Leu 
520 525 530 



1723 



TTT GAT ACA TGT GGC ATC ACT GAT AGO TGC AGC 
Phe Asp Thr Cys Gly He Thr Asp Arg Cys Ser 

540 



AAC ATG TGT OAG 
Asn Met Cys Glu 



30 



CAT CAT OGA OGC TGC TAC CAG TCT TGG OAT GAC 
His Asp Gly Arg Cys Tyr Gin Ser Trp Asp Asp 

555 560 



ATT TGC TAC TGC 
He Cys Tyr Cys 



1771 



1819 



1867 



1915 



GAA CTG ACG GGC TAC AAG GCA GAG ACC TGC CAC ACA CCT TTG TAT AAG 
35 Glu Leu Thr Gly Tyr Lys Gly Glu Thr Cys His Thr Pro Leu Tyr Lys 

570 575 580 



1963 
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GAA TCC TGT GAG OCT TAT CGG CTC AGT GOG AAA ACT TCT GGA AAC TTC 
Olu Ser Cya Glu Ala Tyr Arg I^u Sor Gly bye Thr Ser Gly Asn Phe 

590 



ACC ATT GAT CCT GAT GGC AGT GGC CCC CTG AAG CCA TTT GTA GTG TAC 
5 Thr He Aep Pro Asp Gly Ser Gly Pro I*eu Lye Pro Phe Val Val Tyr 
600 605 610 

IGT GAT ATC OGA GAG AAC GGA GCG TGG ACA GTT GTG CGG CAT OAC AGG 
Cyo Asp He Arg Glu Asn Arg Ala Trp Thr Val Val Arg His Asp Arg 

620 



625 



630 



10 



CTG TGG ACA ACT OGA GTG ACA GGT TCC AGC ATG GAG CGG CCA TTC CTG 

Leu Trp Thr Thr Arg Val Thr Gly Ser Ser Met Glu Arg Pro Phe Leu 

640 



645 



GGG GCT ATC CAG TAC TGG AAT GCA TCC TGG GAG GAA GTC AGT GCC CTT 
Gly Ala He Gin Tyr Trp Asn Ala Ser Trp Glu Glu Val Ser Ala Leu 



650 



660 



2011 



2059 



2107 



2155 



2203 



GCC AAT GCT TCC CAG CAT TGT GAA CAG TGG ATC GAG TTC 

Ala Asn Ala Ser Gin His Cys Glu Gin Trp He Glu Phe 

670 675 



TGC TAC 

Cys Tyr 



20 



AAT TCC CGG CTG CTC AAC ACT GCA GGA GGC TAC CCC TAC 

Asn ser Arg Leu Leu Asn Thr Ala Gly Gly Tyr Pro Tyr 

690 



TTT TOG 

Phe Trp 



680 



ATT GGC OGA AAT GAG GAG CAG CAC TTC TAC TGG GGA GGC TCC CAG CCT 
25 lie Gly Arg Asn Glu Glu Gin His Phe Tyr Trp Gly Gly Ser Gin Pro 

700 705 710 

AGC TGT GTG GAC CCT 
Ser Cys Val Asp Pro 

725 

CAG TGG AGA ACT GAC 
Gin Trp Arg Thr Asp 
740 



30 



GGG ATC CAG OGC TGT GCC TGT GGT CTG GAC 
Gly He Gin Arg Cys Ala Cys Gly Leu Asp 

715 720 

GCC TTG TAC TGC AAC TGT CAC GCT GAC CAG 
Ala I*eu Tyr Cys Asn Cys Asp Ala Asp Gin 

730 735 



2347 



2443 
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A2VG CGA CTG CTG ACC TTT OTG GAC CAT CTG 

Lys Oly Leu Leu Thr Phe Val Asp Hio Leu 
745 750 



GTC ACT CAG GTA OTG 
Val Thr Gin Val Val 

755 



2491 



ATA CGG GAT ACG AAC CCC TCC ACT TCT GAG GCC CAG TTC TTC CTG AGO 

5 He Gly Aep Thr Aan Arg Ser Thr Ser Glu Ala Gin Phe Phe Leu Arg 
760 765 770 



10 



CCT CTG OGC TGC TAT GGC GAT 

Pro Leu Arg Cye Tyr Gly Asp 

775 780 

CAC ACC GGG GCT GCA CTA OGC 
Hie Thr Gly Ala Ala Ircu Arg 

795 



AAT TCC TGG AAC ACC ATT TCC TTC 
Asn Ser Trp Aan Thr lie Ser Phe 
785 790 

CCC CCA ATC CGT GCC AAC CAC AGC 
Pro Pro He Arg Ala Aen His Ser 
800 805 



2539 



2587 



2635 



CTG GAT GTC TCC TTC TAC TTC AGG ACC TCT GCT CCC TOG GGG GTC TTC 
15 Leu Asp Val Ser Phe Tyr Phe Arg Thr Ser Ala Pro Ser Gly Val Phe 

810 815 820 



2683 



GAG AAT ATG GGG GGC CCT TAC TOO CAG TGG OGC OGA CCT TAT CTG 

Glu Asn Met Gly Gly Pro Tyr Cys Gin Trp Arg Arg Pro Tyr Val 

830 



20 



GTG GAA CTC AAC ACA TCC OCG GAT GTG GTC TTC GCC TTT GAT GTG 
Val Glu Leu Asn Thr Ser Arg Asp Val Val Phe Ala Phe Asp Val 
840 845 850 



GGG AAT GGG GAT GAG AAC CTC ACA GTA CAC TCA GAC GAC TTT GAG TTC 
25 Gly Asn Gly Asp Glu Asn Leu Thr Val His Ser Asp Asp Phe Glu Phe 

860 865 870 



2731 



2779 



2827 



30 



AAT OAT GAC GAG TGG CAC CTG GTC CGG GCT GAA ATC AAC GTG AAG CAG 
Asn Asp Asp Glu Trp His Leu Val Arg Ala Glu He Asn Val Lys Oln 

875 880 885 

GCC CGG CTC OGA CTG OAT CAC CGG CCC TGG GTT CTG OGG CCT ATG OCA 
Ala Arg Leu Arg Val Asp His Arg Pro Trp Val Leu Arg Pro Met Pro 

890 895 900 



2875 



2923 



CTG CAG ACC TAC ATC TGG ATG CAG TAT GAC CAG CCC CTC TAT GTG OGA 

35 Leu Gin Thr Tyr He Trp Met Glu Tyr Asp Gin Pro Leu Tyr Val Gly 

905 910 915 



2971 
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TCT OCA GAG 
Ser Ala Glu 
920 



AAG AGA 06C 

Lys Arg Arg 
925 



TTT GTG GOT TCC TTG AGO OCC ATG 
Phe Val Gly Cye Leu Arg Ala Met 

930 



3019 



CGT CTG AAC OCA GTG ACT CTO AAC CTG GAG GGC CGT GCC AAT OCC TCT 
5 Arg I*eu Aen Gly Val Thr I#eu Aen Leu Glu Oly Arg Ala Aon Ala Ser 
935 940 945 950 

GAG OCT ACC TCA CCC AAC TOC ACA GGC CAC TOT GCC CAC CCT COG CTC 
Glu Oly Thr Ser Pro Aen Cys Thr Gly Hie Cye Ala Hie Pro Arg Leu 

955 960 965 

10 

CCC TOT TTC CAT OGA GGC C6C TOC GTG GAG COC TAT AOC TAC TAC AOG 

Pro eye Phe Hie Gly Gly Arg Cye Val Glu Arg Tyr Ser Tyr Tyr Thr 

970 975 9B0 

TOT OAC TOT OAC CTC ACO OCT TTT OAT GOG CCA TAC TOC AAC CAC OAT 

15 cya Aep Cye Aep Leu Thr Ala Phe Asp Gly Pro Tyr Cye Aen Hie Asp 

965 990 995 

ATT GOT COT TTC TTT GAG COG OCC ACC TOG ATO OCC TAT AAC CTA CAG 

lie Oly Oly Phe Phe Olu Pro Oly Thr Trp Met Arg Tyr Asn Leu Gin 
1000 1005 1010 

20 

TCA GCG CTG COC TCT OCA OCC AGO GAG TTC TCC CAC ATO CTG AGC COO 
ser Ala Leu Arg Ser Ala Ala Arg Olu Phe Ser His Met Leu Ser Arg 
1015 1020 1025 1030 

CCA GTG CCA GGC TAT OAO CCT GGC TAC ATC CCG GGC TAT OAT ACT COG 

25 Pro Val Pro Oly Tyr Olu Pro Oly Tyr lie Pro Oly Tyr Asp Thr Pro 

1035 1040 1045 

OCC TAT CTO CCT GGC TAC CAT GGC CCC COO TAC COC CTG CCC OAC TAC 

Oly Tyr Val Pro Gly Tyr His Gly Pro Gly Tyr Arg Leu Pro Asp Tyr 

1050 1055 



1060 



30 



CCC COG CCT GOT CGO CCT GTG CCC GOT TAC COT GOG CCT OTC TAC AAC 
Pro Arg Pro Oly Arg Pro Val Pro Gly Tyr Arg Oly Pro Val Tyr Asn 

1070 1075 



1065 



OTT ACO OGA GAG GAG OTC TCC TTC AGC TTC AOC ACC AGC TCC GCC CCT 
35 Val Thr Gly Glu Glu Val Ser Phe Ser Phe Ser Thr Ser ser Ala Pro 
1080 1085 1090 



3067 



3115 



3163 



3211 



3307 



3355 



3403 



3451 



3499 
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GCT GTC CTG CTC TAC GTC AGT 
Ala Val Leu I#eu Tyr Val Ser 
1095 1100 



TTT GTT CGT GAC TAC ATG GCT GTG 
Phe Val Arg Asp Tyr Met Ala Val 
1105 1110 



CTC ATC AAG GAT GAT GGG ACC CTT CAG CTG CGA TAT CAG CTG GGC ACC 
5 i^u lie Lyo Aep Asp Gly Thr Leu Gin Leu Arg Tyr Gin Leu Gly Thr 

1115 1120 1125 



3547 



3595 



10 



AGT CCC TAC GTG TAC CAG CTA ACC ACT 

Tyr Val Tyr Gin Leu Thr Thr 
1130 1135 



CCC CAT AGC ATC AAT ATC ACC CGT GTT TAC 
Pro HlB ser lie Aen lie Thr Arg Val Tyr 
1145 1150 



GTG ACC GAT GGC CAG 
Val Thr Aep Gly Gin 
1140 

AAC CTC TTC ATC GAG 
Aen Leu Phe He Gin 



GTG GAC TAC TTC CCA CTG ACA GAG CAG AAG TTC TCG CTG TTG GTG GAC 
X5 val Asp Tyr Phe Pro Leu Thr Clu Gin Lye Phe Ser Leu Leu Val Asp 
1160 1165 1170 



AGC CAG TTG GAC TCA CCC AAG GCC TTG TAT TTA GGG CGT GTG ATG GAG 
Ser Gin Leu Asp Ser Pro Lye Ala Leu Tyr Leu Gly Arg Val Met Glu 
1175 1180 1185 1190 

20 

ACA GGA GTC ATT GAC CCG GAG ATC CAG OGC TAC AAC ACC CCA GCT TTC 
Thr Gly Val He Asp Pro Glu He Gin Arg Tyr Aen Thr Pro Gly Phe 

1195 1200 1205 

TCA GGC TGC CTG TCT GGT GTT OGA TTC AAC AAC CTG GCT CCC CTC AAG 
25 Ser Gly Cye Leu Ser Gly Val Arg Phe Asn Asn Val Ala Pro Leu Lye 

1210 1215 1220 

ACC CAC TTC CGA ACC CCT CGA CCC ATG ACT GCT GAG CTA GCT GAG GOC 

Thr Hie Phe Arg Thr Pro Arg Pro Met Thr Ala Glu Leu Ala Glu Ala 
1225 1230 1235 

30 

CTT CGA GTT CAG GGA GAA CTG TCC GAA TCT AAT TGC GGA GCT ATG CCA 
Leu Arg Val Gin Gly Glu Leu Ser Glu Ser Asn Cye Gly Ala Met Pro 
1240 1245 1250 



3691 



3739 



3787 



3835 



3979 
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OGT CTT GTT TCA CAO GTO CCA 
Arg I#eu Val Ser Glu Val Pro 
1255 1260 



GAG CTT GAT 
Glu I«eu Aep 
1265 



TGG TAT CTG CCC 
Trp Tyr l»eu Pro 

1270 



5 Pro 



GAC TTC 
Asp Phe 



TAG TAC CAT GAT GAA GGA TGG GTT GCC ATA CTT TTA 
Tyr Tyr Hio Aep Glu Gly Trp Val Ala lie Leu Leu 
1275 1280 1285 



10 



CGC TTT TTG GTG GCC TTT CTG CTG CTG GGG CTG GTG GGA ATG TTG GTG 
Gly Phe Leu Val Ala Phe Leu Leu Lou Gly Leu Val Gly Met Leu Val 

1290 1295 1300 

CTC TTC TAT CTG CAA AAT CAT CGC TAT AAG GGC TCC TAC CAT ACC AAT 
Leu Phe Tyr Leu Gin Aan His Arg Tyr Lye Gly Ser Tyr His Thr Asn 

1305 1310 1315 



GAG CCC AAG GCT GCC CAC GAG TAC CAT 
15 Glu Pro Lya Ala Ala His Glu Tyr His 
1320 



Pro Thr Ser Gly Pro 



Ala Gin Val 
1340 



GGC AGC AAA 
Gly Ser Lys 
1330 



ACC CCT ACA GCA GCT 
Thr Pro Thr Ala Ala 
1345 



AAC 
Asn 

1350 



20 



Gin Ala Pro 



Ala Ser Ala 
1355 



CCA GCC CCA GCC CCA ACT CCA GCC CCA GCC 
Ala Pro Ala Pro Thr Pro Ala Pro Ala 
1360 



25 Pro Gly Pro 



CCC CGG GAT CAG AAC 

Arg Asp Gin Asn 
1370 



CCC CAG ATC 

Pro Gin lie 
1375 



GAG GAG TCC AGG 
Glu Glu Ser Arg 
1380 



GAA T GAGTCAGAAG GGCTTCTGGG 
Glu 



4027 



4075 



4123 



4171 



4219 



4267 



4315 



4363 



ACCAATTCCA GCTCCTGACA TTCCCCCAGT 4420 



30 



CCTGCCTCTC CCCCATCCTA TCAGGGACAT TTOGCTCCTC TTAGCTGGCT CTGCTCATCC 
AGAGGATATT CCCCCATCCC CCCCCCATCA AGTTTGGTGG GCAGAGCTAC AGATGGGACC 



35 



CAAGGGAGTG GCCGAGCCTC 



GCTCCTGGCT GTTTATCTGC 



4480 



4540 



ACTGCCTAAA CCAATGCCCT TCTCATCCCT GTTTCCCCAG 4600 
CCCAAAGGAG AAGCCTCATG GGGTTGACAT AGGTCCTTTC 4660 
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GTTCCAGCTG CTGTCAGOGA TTAACAACAG AGTGTAGCGG AGATTAACTG 4720 



CCTCCXmrCC AATAGACACT ATCAGCACGG ACAGATGTGT GGGAGTGCAG GGCTGCAGAG 4780 

GAGGAGGCTG CTAAACCCTA TCCCCCAGCC TCCCCCCTGC CCTGAAGATC 4840 



10 



15 



TTCCATTTGC TTCCACTCAG CTGGAGGCTC AAGAGGGCTT GATGGCTGTC CCCTGCCCCC 4900 

CTCCTTTTCT TTTGTACACA GAGACCAAGA GGCCTCAGTT TAGCACCTTA GTACCTOC5GC 4960 

TGCTTCACTT GCTTTAGCCA AAGCCATAAA AAACCTGCAA CGTAGAGAAA ATAATGCAGA 5020 

TACCCTGACT AGOCAGCCCT CTACTCCTCC AAOCTTTTCC AAGATATGCA ATGGCCTTTG 5080 

TGCCTGCCCA AAGGCTTOGC CCCCTCCAGT GCATGAGGAA CCCTCTTTCC TCCGCTCAGA 5140 

GATGCTGCTT CATTTACCCA GGAGGTCATA TTCTTTATAT ATATTTTTTG TTCCAAAGTG 5200 

TCTCTCTAGA GAAACTCTAT ATATTATTCG AATTTTTAAA TTATTTGTTT ATATATAAAA 5260 



GAAAAGCTGA ATTGGCAAAA AAAAAAAAAA AAAA 



5294 



20 (2) IHFORMATION FOR SEQ ID NOt2: 

<1) S£QX7ENCE CHARACTERISTICS t 

(A) LENGTH: 1384 amino acids 

(B) TYPES amino acid 

( D ) TOPOLOGY I unknown 



(ii) HOLECULE TYPEt protein 

(xi) SEQUENCE DESCRIPTION t SEQ ID NOs2: 

Met Met Hie Leu Arg Leu Phe Cye lie Leu I*eu Ala Ala Val Ser Gly 
30 1 5 10 15 



Ala Glu Gly Trp Gly Tyr 

20 



Gly Cys Aep Glu Glu Leu Val Gly 
25 30 



Leu Tyr Ala Arg Ser 



Leu Gly Ala 

40 



Ser Tyr Tyr Ser Leu Leu Thr 

45 



Ala Pro Arg Phe Ala Arg Leu His Gly He Ser Gly Trp Ser Pro Arg 
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50 



55 



60 



lie Gly Aap Pro Aon Pro Trp Leu Gin lie Aep Leu Met Lye Lye Hie 
65 70 75 80 

S Arg lie Arg Ala Val Ala Thr Gin Gly Ser Phe Aen Ser Trp Aop Trp 

85 90 



Val Thr Arg Tyr Met Leu Leu Tyr Gly Asp Arg Val Aop Ser Trp Thr 

100 105 110 

10 Pro Phe Tyr Gin Arg Gly Hlo Aon Ser Thr Phe Phe Gly Asn Val Aon 

lis 120 125 



Clu ser Ala Val Val Arg Hlo Aop Leu Hlo Phe Hio Phe Thr Ala Arg 
130 135 140 

Tyr lie Arg lie Val Pro Leu Ala Trp Aon Pro Arg Gly Lyo lie Gly 
145 150 155 160 



Leu Gly Leu Tyr Gly Cyo Pro Tyr Lyo Ala Aop lie Leu Tyr 

170 175 



20 Phe Aop Gly Aop Aop Ala lie Ser Tyr 

180 185 

Arg Ser Leu Trp Aop Val Phe Ala Phe 

200 



Phe Pro Arg Gly Val Ser 

190 

Phe Lye Thr Glu Glu Lyo 
205 



25 Aop Gly Leu Leu Leu Hie Ala Glu Gly Ala Gin Gly Aep Tyr Val Thr 
210 215 220 

Leu Glu Leu Glu Gly Ala Hlo Leu Leu Leu His Met Ser Leu Gly Ser 

230 235 240 

30 ser Pro He Gin Pro Arg Pro Gly Hlo Thr Thr Val ser Ala Gly Gly 

245 250 255 

Val Leu Aon Aop Gin Hie Trp Hio Tyr Val Arg Val Aop Arg Phe Gly 

260 265 270 

35 xrg Aop Val Aon Phe Thr Leu Aop Gly Tyr Val Gin Arg Phe He Leu 

275 280 285 
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Aen Gly Asp Phe Clu Arg Leu Asn Leu Asp Thr Glu Met Phe He Oly 
290 295 300 

Gly Leu Val Gly Ala Ala Arg Lye Aen Leu Ala Tyr Arg Hie Aen Phe 
305 310 315 320 

Arg Gly Cys He Glu Asn Val He Phe Asn Arg Val Asn He Ala Asp 

330 335 



10 



Leu Ala Val Arg Arg His Ser Arg He Thr Phe Clu Gly Lye Val Ala 

340 345 350 

Phe Arg Cys Leu Asp Pro Val Pro His Pro He Asn Phe Gly Gly Pro 

355 360 



His Asn Phe Val Gin Val Pro Gly Phe 
370 



15 



Val Ser Phe Arg Phe Arg Thr Trp Asp 

390 



Arg Arg Gly Arg I<eu Ala 
380 

Thr Gly X^u Leu Leu Phe 

400 



20 



ser Arg Leu Gly Asp Gly Leu Gly His Val Glu 

405 410 

Gly Gin Val Asn Val Ser He Ala Gin Ser Gly 

420 



Thr Leu Ser Olu 
415 

Lys Lys Leu Gin 
430 



25 



Phe Ala Ala Gly Tyr Arg Leu Asn Asp Gly Phe Trp His Glu Val Asn 
435 440 445 

Phe Val Ala Gin Glu Asn Hie Ala Val He Ser He Asp Asp Val Glu 
450 455 460 



30 



Gly Ala Glu Val Arg Val Ser 
465 470 

Ser Tyr Phe Phe Gly Gly Cys 

485 



Pro Leu Leu He 
475 

Lys Pro Ala Ser 
490 



Thr Gly Thr 
480 

Trp Asp Cys 
495 



35 



Hie Ser Asn Gin Thr Ala Phe His Gly Cys Met Glu Leu Leu Lys Val 

500 505 510 

Asp Gly Gin Leu Val Asn I*eu Thr Leu Val Glu Gly Arg Arg Leu Oly 

520 525 
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Phe Tyr Ala Olu Val t-eu Phe Aep Thr Cys Oly lie Thr Asp Arg Cyo 
530 535 540 

Ser Pro Aen Met: Cys Glu Hie Aep Oly Arg Cys Tyr Gin Ser Trp Asp 
545 550 555 560 

5 

Asp Phe lie Cyo Tyr Cys Glu Leu Thr Gly Tyr Lys Gly Glu Thr Cys 

565 570 575 

His Thr Pro Leu Tyr I*ys Glu Ser Cys Glu Ala Tyr Arg I*eu Ser Gly 

580 585 590 

Lys Thr Ser Gly Asn Phe Thr He Asp Pro Asp Gly Ser Gly Pro I*eu 

600 605 



Lys Pro Phe Val Val Tyr Cys Asp He Arg Glu Asn Arg Ala Trp Thr 
610 615 620 

Val Val Arg His Asp Arg Lou Trp Thr Thr Arg Val Thr Gly Ser Ser 

630 635 640 



Met: Glu Arg Pro Phe Leu Oly Ala He Gin Tyr Trp Asn Ala Ser Trp 

645 650 655 

20 

Glu Glu val Ser Ala Leu Ala Asn Ala Ser Gin His Cys Glu Gin Trp 

660 665 670 

He Glu Phe Ser Cys Tyr Asn Ser Arg Leu Leu Asn Thr Ala Gly Oly 
675 680 



Tyr Pro Tyr Ser Phe Trp He Gly Arg Asn Glu Glu Gin His Phe Tyr 
690 695 700 

Trp Gly Gly Ser Gin Pro Gly He Gin Arg Cys Ala Cys Gly Leu Asp 
705 710 715 720 

30 

Arg Ser Cys Val Asp Pro Ala I*eu Tyr Cys Asn Cys Asp Ala Asp Gin 

725 730 735 

Pro Gin Trp Arg Thr Asp Lys Gly Leu Leu Thr Phe Val Asp His Leu 

740 745 750 

35 

Pro Val Thr Gin Val Val He Gly Asp Thr Asn Arg Ser Thr Ser Glu 
755 760 765 
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Ala Gin Phe Phe Leu Arg Pro Leu Arg Cye Tyr Gly Asp Arg Aon Ser 

770 775 780 

Trp Aan Thr He Ser Phe Hia Thr Gly Ala Ala Leu Arg Phe Pro Pro 
785 790 795 800 

5 

He Arg Ala Aen His Ser Leu Asp Val Ser Phe Tyr Phe Arg Thr Ser 

805 810 815 

Ala Pro Ser Gly Val Phe Leu Glu Aen Met Gly Gly Pro Tyr Cye Gin 

820 825 830 

XO 

Trp Arg Arg Pro Tyr Val Arg Val Glu Leu Aen Thr Ser Arg Aep Val 

840 845 



Val Phe Ala Phe Asp Val Gly Aen Gly Asp Glu Aen Leu Thr Val His 
850 855 860 

Ser Asp Asp Phe Glu Phe Aen Asp Asp Glu Trp His I«eu Val Arg Ala 

870 875 880 



Glu Zle Asn Val Lys Gin Ala Arg I^u Arg Val Asp His Arg Pro Trp 

885 890 895 

20 

Val Leu Arg Pro Met Pro Leu Gin Thr Tyr He Trp Met Glu Tyr Asp 

900 905 910 

Gin Pro Leu Tyr Val Gly Ser Ala Glu X«eu Lys Arg Arg Pro Phe Val 
915 920 



Gly Cys Leu Arg Ala Met Arg Leu Asn Gly Val Thr I«eu Asn X#eu Glu 
930 935 940 

Gly Arg Ala Asn Ala Ser Glu Gly Thr Ser Pro Asn Cys Thr Gly His 
945 950 955 960 

30 

Cys Ala His Pro Arg Leu Pro Cys Phe His Gly Gly Arg Cys Val Glu 

965 970 975 

Arg Tyr Ser Tyr Tyr Thr Cys Asp Cys Asp Leu Thr Ala Phe Asp Gly 

980 985 990 

35 

Pro Tyr Cys Asn His Asp He Gly Gly Phe Phe Glu Pro Gly Thr Trp 
995 1000 1005 
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Met Arg Tyr Asn Leu Gin Ser Ala Leu Arg Ser Ala Ala Arg Glu Phe 
1010 1015 1020 

Ser His Met Leu Ser Arg Pro Val Pro Gly Tyr Glu Pro Gly Tyr lie 
1025 1030 1035 1040 

Pro Gly Tyr Asp Thr Pro Gly Tyr Val Pro Gly Tyr His Gly Pro Gly 

1045 1050 1055 

Tyr Arg Leu Pro Asp Tyr Pro Arg Pro Gly Arg Pro Val Pro Gly Tyr 

1060 1065 1070 

Arg Cly Pro Val Tyr Asn Val Thr Gly Glu Glu Val Ser Phe Ser Phe 

1075 1080 1085 



IS 



Ser Thr Ser Ser Ala Pro Ala Val Leu Leu Tyr Val Ser 
1090 1095 1100 

Arg Asp Tyr Met Ala Val Leu He Lys Asp Asp Gly Thr 
1105 1110 1115 



Ser Phe Val 



Gin Leu 
1120 



Arg Tyr Cln Leu Gly Thr Ser 



20 



Val Thr Asp Gly Gin 
1140 



Pro Tyr Val Tyr Cln Leu Thr Thr Arg 
1130 1135 

His Ser He Asn He Thr Arg Val Tyr 
1145 1150 



25 



Asn Leu Phe He Gin Val Asp Tyr Phe 
1155 1160 

Ser t>eu Leu Val Asp Ser Gin I.eu Asp 
1170 1175 



Leu Thr Glu Gin Lys 
1165 

Pro Lys Ala Leu Tyr 
1180 



30 



Leu Cly Arg Val Met Glu Thr Gly Val He Asp Pro Glu He Gin Arg 
1185 1190 1195 1200 

Tyr Asn Thr Pro Gly Phe Ser Gly Cys Leu Ser Gly Val Arg Phe Asn 

1205 1210 1215 

Asn val Ala Pro Leu Lys Thr Hie Phe Arg Thr Pro Arg Pro Met Thr 

1220 1225 1230 

Ala Glu Leu Ala Glu Ala Leu Arg Val Gin Gly Glu Leu Ser Glu Ser 

1235 1240 1245 
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A8n Cyo Gly Ala Met Pro Arg Leu Val Ser Clu Val Pro Pro Olu Leu 
1250 1255 1260 



Aep Pro Trp Tyr I#eu Pro Pro Asp Phe Pro Tyr Tyr His Asp Olu Gly 

1270 1275 1280 



Trp Val Ala lie Leu Leu Gly Phe Leu Val Ala Phe Leu Leu Leu Gly 

1290 1295 



10 



Leu Val Gly Ket Leu Val Leu Phe Tyr Leu Gin Asn His Arg Tyr Lys 

130O 1305 1310 

Gly ser Tyr Hie Thr Asn Glu Pro Lye Ala Ala His Glu Tyr His Pro 
1315 1320 1325 



Gly Ser Lys Pro 
1330 

Pro Thr Ala Ala 
1345 



Pro Leu Pro Thr Ser 



Gly Pro Ala Gin Val 
1340 



Pro Thr 



Asn Gin Ala 
1350 



Pro Ala Ser Ala Pro Ala Pro 



Ala 

1360 



Pro Thr Pro Ala Pro Ala Pro Gly Pro 



Arg Asp Gin Asn Leu 
1370 



Pro Gin 



20 



Leu Glu Glu Ser Arg Ser Glu 
1380 



(2) INFORMATION FOR SEQ ID NO: 3: 



25 



(i) SEQUENCE CBARACTERISTICS t 

(A) LENGTH: 5350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS I single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 



35 



<iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norvegicui 

< ix ) FEATURE : 

(A) NAME/KEY: CDS 
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(B) XiOCATIONt 154.. 4297 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
5 GATTTTOACT GGGGGTA66A GAAAOGGAAG GGTGGGTGAG CACGCAAAAA GCAGCATCOO 
TCROCOGCOA ACCCCAGOAG AAAAGCTOOG GGCCTOAGCC AGAACCGGAG CCCTAGCGGC 



60 



120 



ACAAGGCAOA CXCCCAGGGT 



10 



TGGTCAGCTC CGC ATO ATO AOT CTC COG CTT TTC 

Met Met Ser Leu Arg Leu Phe 

1 5 



174 



AGC ATT CTG CTC 6CC OCT GTO GTC TCT GOA OCC GAG OGC TGG GOC TAC 
ser lie Leu Leu Ala Ala Val Val Ser Gly Ala Gin Gly Trp Gly Tyr 

10 15 20 

15 TAT GGC TCC AAT GAG GAG CTO OTG GGG CCT CTG TAT GCA OGG TCT CTG 
Tyr Gly Cys Asn Glu Glu Leu Val Gly Pro Leu Tyr Ala Arg Ser Leu 

30 



222 



270 



GOT OCT TCC TCC TAC TAT GOA CTC TTT ACC ACA OCC CGC TTT OCC 
Gly Ala Ser Ser Tyr Tyr Gly l*eu Phe Thr Thr Ala Arg Phe Ala 
20 40 45 50 

CTA CAC GOC ATC AOT CGA TCO TOO CCC OGG ATT OGG OAC CCG AAT 
Leu His Gly He Ser Gly Trp Ser Pro Arg He Gly Asp Pro Asn 

60 65 70 



316 



366 



25 TOO CTC CAO ATA OAC TTA ATO AAO AAG CAT OOA ATC OGG OCT CTC OCC 
Trp Leu om He Asp Leu Met Lys Lys His Arg He Arg Ala Val Ala 

75 80 ®^ 



414 



ACA CAO OOA OCC TTT AAT TCT TOO CAT TOG OTC ACA COT TAC ATO CTC 
Thr cm Gly Ala Phe Asn Ser Trp Asp Trp Val Thr Arg Tyr Met Leu 
30 90 



100 



462 



CTC TAC GOG OAC COT GTO OAC AGC TGG ACA CCA TTC TAC CAA CAA GOG 
Leu Tyr Gly Asp Arg Val Asp Ser Trp Thr Pro Phe Tyr Gin Gin Gly 
105 110 115 



510 



35 
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GAC AAC GC6 ACC 
Hie A0n Ala Thr 
120 

CAT GAC CTG CAC 
5 His Asp Leu His 

CTG GCC TGG AAC 
Leu Ala Trp Aan 

155 

10 

CGT TGT CCC TAC 
Gly Cys Pro Tyr 
170 

ATT TCA TAC CGC 
IS He Ser Tyr Arg 
185 

TTC GCT TTT AGT 
Phe Ala Phe Ser 
200 

20 

ACC GAA GGC TCC 
Thr Glu Gly Ser 

CAC CTG CTG CTG 
25 Hie Leu Leu Leu 

235 

CCT GGT CAC ACC 
Pro Gly Hie Thr 
250 

30 

TGG CAT TAT GTG 

Trp His Tyr Val 
265 

CTG GAT GGT TAC 

35 i^u Asp Gly Tyr 
280 



TTC TTC GGT AAT GTC 
Phe Phe Gly Asn Val 

125 

TAC CAT TTT ACG GCT 
Tyr His Phe Thr Ala 
140 

CCC CGC GGC AAG ATT 

Pro Arg Gly Lys lie 

160 

ACG TCC AAC ATC CTG 
Thr Ser Asn lie Leu 

175 

TTC CAG CGA GGG GCC 
Phe Gin Arg Gly Ala 
190 

TTC AAG ACA GAG GAG 
Phe Lys Thr Glu Glu 
205 

CAG GGG GAT TAT GTG 

Gin Gly Asp Tyr Val 
220 

CAC ATG AGC CTG GGC 
His Met: Ser Leu Gly 

240 

ACG GTG AGC GCT GGT 
Thr Val ser Ala Gly 

255 

CGG GTG GAC CGA TAC 

Arg Val Asp Arg Tyr 
270 

GTA CAT CGC TTT GTG 
val His Arg Phe Val 
285 



AAC GAC TCG GCO GTG 
Asn Asp ser Ala Val 
130 

CGC TAC ATC CGC ATC 
Arg Tyr lie Arg lie 
145 

GGC TTG AGG CTG GGC 

Gly Leu Arg Leu Gly 

165 

TAT TTT GAC GGC GAT 
Tyr Phe Asp Gly Asp 

180 

AGT CAA AGT CTT TGG 
Ser Gin Ser Leu Trp 
195 

AAG GAC GGG CTG CTG 
Lys Asp Gly Leu Leu 
210 

ACG CTT GAA CTG CAA 
Thr Leu Glu Leu Gin 
225 

AGC AGC CCC ATC CAG 
Ser Ser Pro lie Gin 

245 

GGC GTA CTT AAT GAC 
Gly Val Leu Asn Asp 

260 

GGC CGA GAA GCA AAT 

Gly Arg Glu Ala Asn 

275 

CTC AAC GGC GAC TTT 

Leu Asn Gly Asp Phe 
290 
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GTA CGC 558 
Val Arg 

135 

GTG CCA 606 

Val Pro 

150 

ATC TAC 654 
lie Tyr 

GAT GCC 702 
Asp Ala 

GAC GTG 750 
Asp Val 

TTG CAC 798 
Leu Els 
215 

GGA GCA 846 
Gly Ala 

230 

CC6 AGA 894 
Pro Arg 

CTA AGC 942 
Leu Ser 

CTC ACC 990 

Leu Thr 

GAA AGG 1038 
Glu Arg 
295 
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CTG AAT CTC GAA AAT OAC ATA TTC ATC GOA GGT CTA GTG GGC OCA GOG 
I^u Asn Leu Glu Asn Glu lie Phe He Gly Gly Leu Val Gly Ala Ala 

300 305 310 



1086 



AAG AAC CTG GCC TAG 

Lye Aen Leu Ala Tyr 
315 



CAT AAC TTC CGT 

His Aen Phe Arg Gly Cys 
320 



ATA GAA AAC 
lie Glu Aen 



1134 



GTG ATC TAC AAC CGG ATC AAC ATA GCT GAA ATG GCA GTG CAG OGC CAT 
Val He Tyr Aon Arg He Asn He Ala Glu Met Ala Val Gin Arg His 
330 335 340 



XO 



TOG OGG ATC AGO TTC GAG GGT AAT GTG GCT TTC OGC TOO TTG GAT CCC 
Ser Arg He Thr Phe Glu Gly Asn Val Ala Phe Arg Cys Leu Asp Pro 
345 350 



1182 



1230 



GTT CCA 
15 Val 

360 



Pro His Pro 



ATC AAC TTC GGA GGC 
He Asn Phe Gly Gly 



GAC AAC TTC GTO CAA GTG 
His Asn Phe Val Gin Val 

370 



1278 



Pro Gly Phe Pro Arg Arg Gly Arg 



CTT GCT GTC TCC 
Leu Ala Val 



20 



AGO TGG GAC 
Thr Trp Asp 



380 

ACA GGG CTG 

Thr Gly Leu 



395 



CTT TTC TCC OGC 
Leu Phe Ser Arg 
400 



Phe Arg Phe Arg 
390 



GGG GAT GGG 
Gly Asp Gly 
405 



25 i^u 



CTG CGT CAT GTG GAG CTG ATG CTT AGT GAA GGG CAA GTC AAT GTA 
Gly His Val Glu Leu Met Leu Ser Glu Gly Gin Val Asn Val 
410 415 420 



ATC GOG CAG ACT GGC 
He Ala Gin Thr Gly 
425 



AAG AAG CTT CAG TTT GCT OOG GGG TAC 
Lys Lys Leu Gin Phe Ala Ala Gly Tyr 
430 435 



30 



CTG AAT GAT GGC TTC TGG CAT GAG GTG AAC TTT GTG GCA CAG GAA AAC 
Leu Asn Asp Gly Phe Trp His Glu Val Asn Phe Val Ala Gin Glu Asn 
440 445 450 455 

CAT 6CG GTC ATC AGT ATT GAT GAT GTG GAG GGG GCA GAG GTC ACG GTA 

35 His Ala Val He Ser He Asp Asp Val Glu Gly Ala Glu Val Arg Val 

460 465 470 



1374 



1422 



1470 



1518 



1566 
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475 

TGT CCC AAA CCA GCC AGT 
5 cya Pro Lys Pro Ala Ser 

490 



ACA GGG ACT 
Thr Gly Thr 
480 



TAC TTC TTT OCT GGT 
Tyr Pho Phe Oly Oly 
485 



TGG GGC TGC CAC TCC AAC CAG ACA OCA 
Trp Gly Cys His Ser Aon Gin Thr Ala 
495 500 



1614 



1662 



10 



TTC CAT GGC TGC ATO GAG CTG CTC AA6 CTG GAT GGT CAA CTG OTC AAC 

Phe His Gly Cys Met Glu Leu Leu Lye Val Asp Gly Gin Leu Val Asn 
505 510 515 

CTC ACT CTG GTA GAG TTT CGG AAG CTT GGT TAC TTT GCT GAG GTC CTC 

Leu Thr Leu Val Glu Phe Arg Lys Leu Gly Tyr Phe Ala Glu Val Leu 

525 530 



1710 



1756 



TTT GAC ACA TGT GGC ATC ACA GAC AGA TGC AGC CCT AAT ATC TGT GAG 
15 Phe Asp Thr Cys Gly lie Thr Asp Arg Cys Ser Pro Asn Met Cys Glu 

540 



1806 



CAT GAT GGG OGC TGC TAC CAG TCT TGG GAT GAC TTC ATC TGC TAC TCC 
Hla Asp Gly Arg Cys Tyr Gin Ser Trp Asp Asp Phe lie Cys Tyr Cys 

560 



20 



OAA CTC ACC GGC TAC AAG OCA GTT ACC TGC CAT GAA CCA TTG TAT AAG 
Glu Leu Thr Gly Tyr Lys Gly Val Thr Cys His Glu Pro Leu Tyr Lys 
570 575 580 



1854 



1902 



GAG TCC TGT GAA GCC TAT CGC CTC AGC GGG AAA TAT TCT GGA AAT TAC 
25 Glu Ser Cys Glu Ala Tyr Arg Leu Ser Gly Lys Tyr Ser Gly Asn Tyr 

590 



1950 



30 



ACC ATT GAT 

Thr He Asp 
600 



TGT GAT ATC OGA 



GAT GGC AGT GGA 
Asp Oly Ser Gly 
605 

GAG AAC COA GCG 
Glu Asn Arg Ala 
620 



CTG AAA CCA TTT GTA GTG TAT 
Leu Lys Pro Phe Val Val Tyr 
610 



1998 



ACA GTT GTG COA CAT GAC AGO 
Thr Val Val Arg His Asp Arg 
625 630 



2046 
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CTA TGG ACC ACT 

Leu Trp Thr Thr 

635 



GTO ACA GOT TCG AGC ATG GAC 
Val Thr Gly Ser Ser Met Asp 

640 



CCC TTT CTG 
Pro Phe Leu 
645 



2094 



GOG OCC ATC CAA TAC TCG AAT GCC TCC TOG GAG GAA GTC AGT OCT CTG 
5 Gly Ala lie Gin Tyr Trp Asn Ala Ser Trp Glu Glu Val Ser Ala Leu 

650 655 660 

OCC AAT OCT TCC CAG CAC TOT GAG CAG TGG ATC GAG TTC TCC TGC TAC 
Ala Aon Ala Ser Gin His Cyo Glu Gin Trp He Glu Phe Ser Cye Tyr 
665 670 

10 

AAT TCC COO CTG CTC AAC ACT OCA GOA GOC TAC CCC TAC AGC TTT TGG 
Aen ser Arg Leu Leu Aan Thr Ala Gly Gly Tyr Pro Tyr Ser Phe Trp 
680 685 690 695 

ATT OOC OOA AAT CAA GAA CAG CAT TTC TAC TCG GOA OGC TCC CAG CCT 
15 He Gly Arg Aon Glu Olu Gin Hie Phe Tyr Trp Gly Oly Ser Gin Pro 

700 705 710 

GOO ATC CAA OGC TOT GCC TOT GOG CTG GAC CAG AGC TGT ATA GAC CCT 
Oly He Gin Arg Cye Ala Cya Oly Leu Asp Gin Ser Cys He Asp Pro 

715 720 725 



20 



CCA CTO CAC TGC AAC TGC OAT OCT GAC CAG CCA CAG TOG ACA ACA GAC 
Ala Leu His Cys Asn Cys Asp Ala Asp Gin Pro Oln Trp Arg Thr Asp 
730 735 740 



AAG GOG CTC CTG ACC TTT GTO GAC CAT CTG 
25 Lys Gly Leu Leu Thr Phe Val Asp His Leu 
745 750 



GTC ACT CAG OTA GTG 
Val Thr Gin Val Val 

755 



30 



ATA OOT GAC ACA AAC OCC TCC AGC TCT GAA OCT CAG TTC TTC CTO AGO 
He Oly Asp Thr Asn Arg Ser Ser Ser Olu Ala Gin Phe Phe Leu Arg 
760 765 770 775 

CCT CTG CCC TOT TAT GOT GAC COC AAT TCC TOG AAC ACT ATC TCC TTC 
Pro Leu Arg Cys Tyr Gly Asp Arg Asn Ser Trp Asn Thr He Ser Phe 

780 785 790 



OGC ACT GGA GCT OCA CTO CGT TTC CCT CCA ATC COT OCC AAC CAC ACC 
35 Arg Thr Gly Ala Ala Leu Arg Phe Pro Pro He Arg Ala Asn His Ser 

800 SOS 



2142 



2190 



2286 



2430 



2478 



2574 
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CTT GAT GTC 
Leu Asp Val 
810 



TTC TAC TTC AGG ACC 

Phe Tyr Phe Arg Thr 

815 



Ser Ala Pro 



TCA GGA GTC TTC 

Ser Gly Val Phe 
820 



CTA GAG AAC ATG GGG GGT COT TTC TGC CAG TGG COC CGA CCT TAC GTG 
5 Leu Glu Aon Met Gly Gly Pro Phe Cye Gin Trp Arg Arg Pro Tyr Val 
B25 830 



10 



AGA GTG GAG CTC AAC ACA TCC OGG GAT GTG GTC TTT GCC TTT GAT ATT 
Arg Val Glu X*eu Asn Thr Ser Arg Aop Val Val Phe Ala Phe Aap lie 
840 845 850 855 

GGC AAT OGG GAT GAG AAC CTG ACA GTG CAC TCA OAT OAC TTC GAG TTC 
Gly Aan Gly Aap Glu Aen X*eu Thr Val Hta Ser Aap Aap Phe Glu Phe 

860 865 870 



2670 



2718 



2766 



AAT OAT GAC GAG TGG CAT TTG GTC OGG OCT GAA ATC AAC GTG AAG CAG 

15 Aan Aap Aap Glu Trp Hie Leu Val Arg Ala Glu lie Aan Val Lya Gin 

880 885 



2814 



GCC CCG CTG CGA GTG GAC CAT COG CCC TGG GTG CTA AGG CCC ATG CCC 
Ala Arg Leu Arg Val Aap Hia Arg Pro Trp Val Leu Arg Pro Met Pro 
890 895 900 



20 



CTG CAG AC6 TAC ATC TGG CTG OAG TAT OAC CAA CCC CTC TAT GTG GGA 
Leu Gin Thr Tyr He Trp Leu Glu Tyr Aap Gin Pro Leu Tyr Val Gly 
905 910 915 



29X0 



TCT OCA GAG CTT AAG AGG 
25 Ser Ala Glu Leu Lya Arg 
920 925 

CGT TTG AAT GGA GTG ACT 

Arg Leu Aan Gly Val Thr 

940 



CCA TTT GTG GGG TGC TTG AGG GCC ATG 
Pro Phe Val Gly Cye Leu Arg Ala Met 

930 



2958 



30 



GAG GGC ACC TTC CCC AAC 
Glu Gly Thr Phe Pro Aan 

955 



AAC TTG OAG GGT COT GCC AAT OCC TCC 

Aan Leu Glu Gly Arg Ala Aan Ala Ser 
945 950 

ACG GGC CAC TGC ACC CAC OCC COG TTC 

Thr Gly Hia Cya Thr Hla Pro Arg Phe 
960 965 



3006 



3054 



CCC TOT TTC CAC GGA GGA CGC TOT GTG CAG CGA TAC AGC TAC TAC ACG 
35 Pro Cya Phe Hta Gly Gly Arg Cya Val Glu Arg Tyr Ser Tyr Tyr Thr 

970 975 980 



3102 
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TGT OAC TGT OAC CTC ACA OCT TTT GAT GOA CCA TAT TGT AAT CAC OAT 3150 
eye ABp Cya Asp Leu Thr Ala Phe Aep Oly Pro Tyr Cyo Asn Hie Asp 

990 995 



OCA GTC CTG CTC TAC OTC AGC TCC TTT OTO COT OAC TAC ATG GCC OTG 
25 Ala Val Leu Leu Tyr Val Ser Ser Phe Val Arg Aep Tyr Met Ala Val 

1100 1105 1110 

CTC ATC AAG OAA GAT GCC ACC CTA CAG CTT OGC TAT CAG CTG GOC ACC 
Leu lie Lye Olu Aep Oly Thr Leu Gin Leu Arg Tyr Gin Leu Gly Thr 

1115 1120 1125 

30 

ACT CCC TAT OTG TAC CAG CTA ACC ACC COG CCA OTG ACC GAT GOC CAA 
ser Pro Tyr Val Tyr Gin Leu Thr Thr Arg Pro Val Thr Aep Gly Gin 
1130 1135 11*0 



V 



ATT GOT GGA TTC TTT GAG ACT GGC ACA TOO ATG CGC TAT AAC CTC CAG 3198 
5 He Gly Gly Phe Phe Glu Thr Gly Thr Trp Met Arg Tyr Aon Leu Gin 
1000 1005 1010 1015 

TCA OCA CTG COT TCT GCG GCC CAG GAG TTC TCT CAC ATG CTG AGC COG 3246 
ser Ala Leu Arg Ser Ala Ala Gin Glu Phe Ser Hie Met Leu Ser Arg 

1020 1025 1030 

10 

ceo OTA CCG GGC TAT GAG CCT GOC TAT ATC CCA OGC TAC OAC ACT CCT 3294 
Pro Val Pro Gly Tyr Glu Pro Oly Tyr He Pro Oly Tyr Aep Thr Pro 

1035 1040 1045 

GOT TAC OTG CCT GOG TAC CAT GOT CCT GOG TAC CGC CTA CCC OAC TAC 3342 
15 Gly Tyr Val Pro Oly Tyr Hie Oly Pro Gly Tyr Arg Leu Pro Asp Tyr 

1050 1055 1060 

CCA AGO CCT GGC OGO CCA OTG CCC GGA TAC COG GOG CCC OTO TAC AAT 3390 

Pro Arg Pro Oly Arg Pro Val Pro Oly Tyr Arg Oly Pro Val Tyr Asn 

1065 1070 1075 

20 

OTT ACT GGA GAG OAG OTC TCC TTT AGC TTC AGC ACC AGC TCT OCT CCT 
Val Thr Gly Olu Olu Val Ser Phe Ser Phe Ser Thr Ser Ser Ala Pro 
1080 1085 1O90 1095 



3438 



3486 



3534 



3582 
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CCC CAT AGT GTC AAC ATC ACC OGG GTC TAC 
Pro Hie ser Val Asn lie Thr Arg Val Tyr 
1145 1150 



AAC CTC TTT ATC CAC 
Asn Leu Phe lie Gin 
1155 



CTG GAG TAC TTC CCG CTG ACA GAA CAG AAG TTC TCT CTC CTG GTG GAC 
5 Val Aop Tyr Phe Pro I^u Thr Glu Gin Lye Phe Ser I*eu Leu Val Asp 
1160 1165 1170 1175 

AGC CAG CTG GAC TCC CCC AAG GCC TTG TAT CTA GGG CGT GTG ATG GAG 
Ser Gin Leu Aep Ser Pro Lye Ala Leu Tyr Leu Gly Arg Val Met Glu 

1180 1185 1190 



10 



ACA GGA GTC ATT GAC CCA GAG ATT CAG GGG TAC AAC ACC CCA GGT TTC 
Thr Gly Val He Aep Pro Glu He Gin Arg Tyr Asn Thr Pro Gly Phe 

1195 1200 1205 



3630 



3678 



3726 



3774 



TCA CCC TGC CTG TCT GOT CTC OGG TTC AAC AAT GTG OCT CCT CTC AAG 
15 Ser Gly Cye Leu Ser Gly Val Arg Phe Asn Asn Val Ala Pro Leu Lys 

1210 1215 1220 



3822 



ACC CAT TTC CGA ACC CCT OGC OCC ATG ACT OCT GAG CTG GCG GAG GCC 
Thr HtB Phe Arg Thr Pro Arg Pro Met Thr Ala Glu I^u Ala Glu 
1225 1230 



3870 



20 

ATG OGG GTT CAA GGA GAA CTG TCG GAG TCT AAC TGT OGC OCT ATG OCA 
Met Arg Val Gin Gly Glu Leu Ser Glu Ser Asn Cys Gly Ala Met Pro 
1240 1245 1250 125! 

CGC CTT GTC TCC GAG GTG CCA CCA GAG CTT GAT CCC TOG TAC CTG OCC 
25 Arg Leu Val Ser Glu Val Pro Pro Glu Leu Asp Pro Trp Tyr Leu Pro 

1260 1265 1270 



3918 



3966 



30 



CCA OAT TTC CCA TAC TAC CAT GAC GAC GGA 

Pro Asp Phe Pro Tyr Tyr His Asp Aep Gly 

1275 1280 

GGT TTT TTG GTG OCC TTC CTG CTG CTG GGG 
Gly Phe Leu Val Ala Phe Leu Leu Leu Gly 
1290 1295 



ATT GCC ATA CTC TTA 
lie Ala He Leu Leu 
1285 

GTG GGA ATG CTG GTG 
Val Gly Met Leu Val 
1300 



4014 



4062 



CTG TTC TAT CTG CAA AAT CAT CGA TAC AAG OGC TCC TAT CAC ACC AAC 
35 Leu Phe Tyr Leu Gin Asn His Arg Tyr Lys Gly Ser Tyr His Thr Asn 
1305 1310 1315 



4110 
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GAG CCC AAG GCC ACC CAT GAT TCC CAC OCT GGC GGC AAA GOT CCC CTA 4156 
Glu Pro Lye Ala Thr His Aep Ser His Pro Gly Cly Lye Ala Pro Leu 
1320 1325 1330 1335 

CCT CCC TCA GCC CCT GCC CAG GCC CCT GCC CCC ACT CCA GCT CCC ACC 4206 
5 Pro Pro Ser Gly Pro Ala Gin Ala Pro Ala Pro Thr Pro Ala Pro Thr 

1340 1345 1350 

CAG GTT CC6 ACC CCA GCC CCA GCC CCA GCC TCT GGC CCA GGC CCC AGG 4254 

Gin Val Pro Thr Pro Ala Pro Ala Pro Ala Ser Gly Pro Gly Pro Arg 

1355 1360 1365 

XO 

GAC CAG AAC CTC CCC CAG ATC TTG GAG GAG TCC AGG TCT GAA T 4297 
Aep Gin Asn Leu Pro Gin lie Leu Glu Glu Ser Arg Ser Glu 
1370 1375 1360 

OAGTCACAA6 GGCTTCAGGG ACCAAGGCCA ACTCCTCTAA GTCCCTTCAG CTCCTGCCTC 4357 

15 

TCCTCTCCCC TGTCAGGGAC A TTT G GCTCT TCTTAGCAGG CTCTGTTCAC CAGGAGGATC 4417 

CCCTCTTGCC AAGTTTGGTG T6CAGA6CTA CAGATGGGAC CAAAGGGAGT GGCCGAGTCT 4477 

CACTGCCTAA ACCAATGCCC TGCCCCCACC CCCCACCCCA GCTCCTGGCT GTTTGCCTGC 4537 

20 

CCTACGGGAG AAAGCTCATO GAGCT6AGGC OGOCCTTTCC TGCCATCTCT GTCCCAOCTG 4597 

CTGGCAAGGA TTAACAACCA AGGGCAGGGG AGGTGAACTG CCTCCCTTCC TGTGGGTATT 4657 

ATCAGCAOGG ACAGATGTGG CGGATOGAGG GGCTGCACAG GGCAGGCAGG GAGGGAGGGA 4717 

25 

GGAGGCTGCT AAAACACACC CTGGAGCCTC CCCCCTGCCC TGCTGACCGG CTGTCTTCCA 4777 

TCTGCTTCCT CTCAGCTOGG GTTGAGGGAA GAACTTCATC COCACCCCCC ACCTCACCCA 4637 

ACCCTTTTTG TTCTTACAGA GACCAAGAGG CCTCAGCTTA GCACTTTAGT ACCTCCACTG 4697 

30 

CTTCACATGC TTTAGCCAAA GCCATAAAAA GCCTGCAA6T AGAAGAAATA ATGCAGACCC 4957 

TGCCCAGCCA GTCCTCTGCT CCTCCACCCC TTTCCAAAAT ACGCAATAGC CTGGGGTGCC 5017 

TGTGTGCAGG CCTGGCCCCT GCGTGCATGA GGAGCCCCTC CCGCTCAGAG ATGCTGCGAG 5077 

35 

TTCCTCCAGG AGGTCATATT CTTTATATAT ATTTTTTGTT GCAAAGCCTC TCTCTAGAGA 5137 
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5197 



ACTATATATT ACTCTAATTT TTTAATTATC TGTTTATATA TAAAACAACT CAGTGGGCOC 
TCCTGTGCTG TGCCCAGTTT GTAGTGAGCT CCTTCTGTTG GATGTCTCAT GAGTCCTGCC 5257 
AGCCACTCAC COGCCTGCCX; GGCCTCCATT CTAGAGCAGG CAGAGCCCGC TGTGCCCTCA 5317 



CCTGAGCAGG TTCAATAAAA GCAGAGTGGC A6A 



5350 



(2) IHPORMATION FOR SEQ ZD NOt4: 

10 <1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1381 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



<ii) MOLECULE TYPE: protein 

15 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs4l 

Met Met Ser Leu Arg Leu Phe Ser He Leu Leu Ala Ala Val Val 
15 10 15 



20 Gly Ala Gin Gly Trp Gly Tyr Tyr Gly Cya Aon Clu Glu Leu Val Gly 

20 25 30 

Pro I*eu Tyr Ala Arg Ser Leu Gly Ala Ser Ser Tyr Tyr Gly I*eu Phe 

35 40 45 



Thr Thr Ala Arg Phe Ala 
50 



Leu His Gly He Ser Gly Trp Ser Pro 

60 



Arg He Gly Asp Pro Aen 

70 



Trp Leu Gin He Asp Leu Met Lye Lys 

75 80 



30 Hio Arg He Arg Ala Val Ala Thr Gin Gly Ala Phe Asn Ser Trp Asp 

85 90 95 

Trp Val Thr Arg Tyr Met Leu Leu Tyr Gly Asp Arg Val Asp Ser Trp 

100 105 110 

35 Thr Pro Phe Tyr Gin Gin Gly Hie Aen Ala Thr Phe Phe Gly Asn Val 

115 120 125 
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Aan ABp Ser Ala Val Val Arg Hia Aep Leu Hie Tyr Hie Phe Thr Ala 
130 135 1*0 

Arg Tyr He Arg He Val Pro Leu Ala Trp Asn Pro Arg Oly Lye He 
145 150 155 160 

5 

Gly Leu Arg Leu Gly He Tyr Oly Cye Pro Tyr Thr Ser Asn He Leu 

170 175 

Tyr Phe Aep Cly Aep Asp Ala He Ser Tyr Arg Phe Gin Arg Oly Ala 

180 185 190 

10 

ser Oln Ser Leu Trp Asp Val Phe Ala Phe Ser Phe Lye Thr Olu Olu 
195 200 205 

Lys Asp Oly Leu Leu Leu Hie Thr olu Oly Ser Gin Gly Asp Tyr val 
210 215 220 

15 

Thr Leu Olu Leu Oln Oly Ala His Leu Leu Leu His Met Ser Leu Oly 
225 230 235 240 

ser Pro He Gin Pro Arg Pro Cly His Thr Thr Val Ser Ala Oly 

250 255 



20 

Cly val Leu Asn Asp Leu Ser Trp His Tyr Val Arg Val Asp Arg Tyr 

260 265 270 

Gly Arg Olu Ala Asn Leu Thr Leu Asp Oly Tyr Val His Arg Phe Val 
275 280 285 

25 

Leu Asn Cly Asp Phe Olu Arg Leu Asn Leu Olu Asn Olu He Phe He 
290 295 300 

Cly Gly Leu Val Gly Ala Ala Arg Lys Asn Leu Ala Tyr Arg His Asn 
305 310 315 320 

30 

Phe Arg Oly Cys He Olu Aen Val He Tyr Asn Arg He Asn He Ala 

325 330 335 

Olu Met Ala val Oln Arg Hie Ser Arg He Thr Phe Olu Gly Asn Val 

340 345 350 

35 

Ala Phe Arg Cys Leu Asp Pro Val Pro His Pro He Asn Phe Oly Oly 
355 



360 365 
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« 

Pro Hia Asn Phe Val Gin VaL Pro Gly Phe Pro Arg Arg Cly Arg Leu 
370 375 380 

Ala Val Ser Phe Arg Phe Arg Thr Trp Aep Leu Thr Gly Leu Leu Leu 
385 390 395 400 

5 

Phe ser Arg Leu Gly Aep Gly Leu Gly Hifl Val Glu Leu Met Leu Ser 

405 410 415 

Glu Gly Gin Val Asn Val Ser lie Ala Gin Thr Gly Arg Lye Lys Leu 

420 425 430 

10 

Gin Phe Ala Ala Gly Tyr Arg Leu Asn Asp Gly Phe Trp His Glu Val 
435 440 445 

Asn Phe Val Ala Gin Glu Asn His Ala Val lie Ser lie Asp Asp Val 
450 455 460 

15 

Glu Gly Ala Glu Val Arg Val Ser Tyr Pro Leu Leu lie Arg Thr Gly 
465 470 475 480 

Thr Ser Tyr Phe Phe Gly Gly Cys Pro Lys Pro Ala Ser Arg Trp Gly 

485 490 495 

20 

Cys His Ser Asn Gin Thr Ala Phe His Gly Cys Met Glu Leu Leu Lys 

500 505 510 

Val Asp Gly Gin Leu Val Asn Leu Thr Leu Val Glu Phe Arg Lys Leu 
515 520 525 

25 

Gly Tyr Phe Ala Glu Val Leu Phe Asp Thr Cys Gly lie Thr Asp Arg 
530 535 540 

Cys Ser Pro Asn Met Cys Glu Hie Asp Gly Arg Cys Tyr Gin Ser Trp 
545 550 555 560 

30 

Asp Asp Phe lie Cys Tyr Cys Glu Leu Thr Gly Tyr Lys Gly Val Thr 

570 575 



Cys His Glu Pro Leu Tyr Lys Glu Ser Cys Glu Ala Tyr Arg Leu Ser 

580 585 590 

35 

Gly Lys Tyr Ser Gly Asn Tyr Thr lie Asp Pro Asp Gly Ser Gly Pro 

600 605 
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Leu LyB Pro Phe Val Val Tyr cys Asp lie Arg Glu Ken Arg Ala Trp 

615 620 



610 

Thr Val Val 
625 



8er Me^ Aep Arg 



HlB ABp Arg Leu Trp Thr Thr Arg Val Thr Gly Ser 
630 635 640 

Pro Phe Leu Oly Ala lie Gin Tyr Trp Aen Ala Ser 
645 650 



10 



Glu Glu Val 
660 

lie Glu Phe 
675 



Ser Ala Leu Ala Asn Ala Ser 



Gin Hia eye Glu Gin 
670 



Cya Tyr Asn Ser Arg 
680 



Leu I.eu Asn Thr Ala Gly 



Gly Tyr Pro Tyr Ser Phe Trp lie Gly Arg Aan Glu Glu Gin His Phe 

700 



15 



20 



30 



690 



Tyr Trp Gly Gly ser Gin Pro Gly He Gin Arg Cye Ala Cys Gly Leu 
705 710 715 720 

Asp Gin ser Cya He Asp Pro Ala Leu His Cys Asn Cys Asp Ala Asp 

725 730 ^735 

Gin Pro Gin Trp Arg Thr Asp Lys Gly Leu Leu Thr Phe Val Asp His 

740 745 750 

Leu Pro Val Thr Gin Val Val He Gly Asp Thr Asn Arg Ser Ser Ser 

755 760 765 

Glu Ala Gin Phe Phe Leu Arg 

770 775 

Ser Trp Asn Thr He Ser Phe 
785 790 

Pro He Arg Ala Asn His Ser 

805 



Leu Arg Cys Tyr Gly Asp Arg Asn 

780 

Thr Gly Ala Ala Leu Arg Phe Pro 
795 800 

Asp Val Ser Phe Tyr Phe Arg Thr 
810 



ser Ala Pro Ser Gly Val Phe Leu Glu Asn Met Gly Gly Pro Phe Cys 

820 625 630 

Gin Trp Arg Arg Pro Tyr Val Arg Val Glu Leu Asn Thr Ser Arg Asp 

835 640 
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Val Val Phe Ala Phe Aep lie Gly Asn Gly Asp Glu Aon Leu Thr Val 
850 855 860 

HlB Ser Aep Aep Phe Glu Phe Asn Asp Asp Glu Trp His I*eu Val Arg 
865 870 875 880 

5 

Ala Glu He Asn Val Lys Gin Ala Arg Leu Arg Val Asp His Arg Pro 

885 890 



Trp Val Leu Arg Pro Met Pro I#eu Gin Thr Tyr He Trp Leu Glu Tyr 

900 905 910 

XO 

Asp Gin Pro Leu Tyr Val Gly Ser Ala Glu Leu Lys Arg Arg Pro Phe 
915 920 



Val Gly Cys Leu Arg Ala Met Arg Leu Asn Gly Val Thr Leu Asn Leu 
930 935 940 

15 

Glu Gly Arg Ala Asn Ala Ser Glu Gly Thr Phe Pro Asn Cys Thr Gly 
945 950 955 960 

His Cys Thr His Pro Arg Phe Pro Cys Phe His Gly Gly Arg Cys Val 

965 970 975 

20 

Glu Arg Tyr Ser Tyr Tyr Thr Cys Asp Cys Asp Leu Thr Ala Phe Asp 

980 985 990 

Gly Pro Tyr Cys Asn His Asp He Gly Gly Phe Phe Glu Thr Gly Thr 
995 1000 1005 

25 

Trp Met Arg Tyr Asn Leu Gin Ser Ala Leu Arg Ser Ala Ala Gin Glu 
1010 1015 1020 

Phe Ser His Met Leu ser Arg Pro Val Pro Gly Tyr Glu Pro Gly Tyr 
1025 1030 1035 1040 

30 

He Pro Gly Tyr Asp Thr Pro Gly Tyr Val Pro Gly Tyr His Gly Pro 

1045 1050 1055 

Gly Tyr Arg Leu Pro Asp Tyr Pro Arg Pro Gly Arg Pro Val Pro Gly 

1060 1065 1070 

Tyr Arg Gly Pro Val Tyr Asn Val Thr Gly Glu Glu Val Ser Phe Ser 
1075 1080 1085 
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Phe Ser Thr Ser 
1090 



Ala Pro Ala Val Leu Leu Tyr Val Ser 
1095 1100 



Ser Phe 



Val Arg Asp Tyr Met Ala Val Leu lie Lye Glu Asp Gly Thr Leu Gin 
1105 1110 1115 1120 



10 



Leu Arg Tyr Gin Leu Gly Thr Ser Pro Tyr Val Tyr Gin Leu Thr Thr 

1125 1130 1135 

Arg Pro Val Thr Aep Gly Gin Pro Hia Ser Val Aen lie Thr Arg Val 

1140 1145 1150 

Tyr Arg Asn Leu Phe lie Gin Val Aep Tyr Phe Pro Leu Thr Glu Gin 
1155 1160 1165 



Lye Phe Ser Leu Leu Val Asp Ser Gin Leu Asp Ser Pro Lys Ala Leu 
1170 1175 1180 

Tyr Leu Gly Arg Val Met Glu Thr Gly Val lie Asp Pro Glu He Gin 



1185 



1190 



20 



Arg Tyr Aon Thr Pro Gly Phe Ser Gly Cys 

1205 1210 

Asn Aon Val Ala Pro Leu Lys Thr His Phe 

1220 1225 



1200 



Leu Ser Gly Val Arg Phe 



Thr Pro Arg Pro Met 
1230 



Thr Ala Glu Leu Ala Glu Ala Met Arg Val Gin Gly Glu Leu 

1240 1245 



Ser Glu 



25 



Aon Cys Gly Ala Met Pro Arg Leu Val Ser Glu Val Pro 

1260 



Pro Glu 



Leu Asp Pro Trp Tyr Leu Pro Pro Asp Phe Pro Tyr Tyr Hie Asp Asp 

1275 1280 



1265 



1270 



30 



35 



Gly Trp He Ala He Leu Leu Gly Phe Leu Val Ala Phe Leu 

1285 1290 1295 



Gly Leu Val Gly Met Leu Val Leu Phe Tyr Leu Gin Asn Hto Arg Tyr 

1300 1305 1310 

Lys Gly Ser Tyr His Thr Asn Glu Pro Lys Ala Thr His Asp Ser His 
1315 1320 1325 
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Pro Oly Gly Lyo Ala Pro l^u Pro Pro Ser Gly Pro Ala Gin Ala Pro 
1330 1335 1340 

Ala Pro Thr Pro Ala Pro Thr Gin Val Pro Thr Pro Ala Pro Ala Pro 
1345 1350 1355 1360 

5 

Ala ser Gly Pro Gly Pro Arg Aep Gin Asn Leu Pro Gin lie Leu Glu 

1370 1375 



Glu ser Arg Ser Glu 

1380 

10 

The present invention is not to be limited in scope by 
the exemplified embodiments which are intended as 
illustration of single aspects of the invention, and any 

xs clones, DNA or amino acid sequences which are functional 
equivalent are within the scope of the invention. Indeed, 
various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the 
art from the foregoing description and accompanying drawings. 

20 Such modifications are intended to fall within the scope of 
the appended claims. 

All references cited herein are hereby incorporated by 

reference in their entirety. 



25 



30 
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^ftT TS CIAIMED IS: 

1. An isolated nucleic acid molecule having a 
nucleotide sequence that: 
5 (a) encodes the amino acid secjfuence SEQ ID NO: 2; 

(b) encodes the amino acid seqpience SEQ ID NO: 4; or 

(c) is the complement of the nucleotide sequence of (a) 
or (b) . 

3^0 2. An isolated nucleic acid molecule having a 

nucleotide sequence that hybridizes under highly stringent 
conditions to the nucleic acid molecule of Claim 1, and 
encodes a naturally occurring pl90 polypeptide. 

15 3* A nucleic acid molecule which comprises (a) a 

nucleotide sequence that encodes a polypeptide having the 
amino acid sequence shown in SEQ ID NO: 2 from amino acid 

residues 40-168, 199-330, 362-486, 544-576, 582-739, 809-938, 
961-985, 1031-1077, 1083-1218, 1282-1306, or 1328-1369; or 

20 (b) the complement of the nucleotide sec[uence of (a) . 

4. A nucleic acid molecule which comprises (a) a 
nucleotide sequence that encodes a pl90 polypeptide lacking 
at least one domain which has an amino acid sequence shown in 
25 SEQ ID NO:2 from amino acid residues 40-168, 199-330, 362- 

486, 544-576, 582-739, 809-938, 961-985, 1031-1077, 1083- 

1218, 1282-1306, or 1328-1369; or (b) the complement of the 
nucleotide sequence of (a) . 

3Q 5. A recombinant vector containing the nucleic acid 

molecule of claims 1 , 2 , 3 , or 4 - 



6. The 
nucleic acid mo 
35 element that 



vector of claim 5 wherein the 
is operatively associated with an 
the expression of the nucleic acid 



molecule in a host cell 
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7. An engineered host cell containing the nucleic acid 
molecule of Claim 1, 2, 3, or 4 . 

8. An engineered host cell containing the nucleic acid 
5 molecule of Claim 1, 2, 3, or 4 operatively associated with 

an element that controls the expressing of the nucleic acid 
molecule by the engineered host cell. 

9. The engineered host cell of Claim 8 which is 
10 eukaryotic. 

10. The engineered host cell of Claim 8 which is 



15 11. An isolated polypeptide comprising an amino acid 

encoded by the nucleic acid molecule of Claim 1, 2, 

3 or 4. 

12. The isolated polypeptide of Claim 11, wherein the 
20 polypeptide comprises the amino acid sequence of SEQ ID NO: 2 

or SEQ ID NO: 4. 

13. A method for preparing a pl90 polypeptide 

(a) culturing a eulcaryotic host cell which 
contains the nucleotide sequence of Claim 1, 
2, 3, or 4 operatively associated with an 
element that controls expression of the DNA 
sequence so that a pl90 polypeptide is 
expressed by the host cell; and 

(b) recovering the pl90 polypeptide from the 



30 



14. A method for preparing a pl90 polypeptide 

35 comprising: 

(a) culturing a prokaryotic host cell which 

contains the nucleotide sequence of Claim 1, 
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2, 2, or 4 operatively associated with an 
element that controls expression of the DNA 
sequence so that a pl90 polypeptide is 
expressed by the host cell; and 
(b) recovering the pl90 polypeptide from the 



15, The isolated nucleic acid molecule of Claim 1 
wherein the nucleotide sequence comprises the nucleotide 
10 sequence of SEQ ID NO:l or SEQ ID NO: 3. 



20 



25 



30 



35 
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